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Abstract 



The predictions of quantum mechanics cannot be resolved with a completely 
classical view of the world. In particular, the statistics of space-like separated 
measurements on entangled quantum systems violate a Bell inequality [Belll964] . 

We put forward a computational perspective on a broad class of Bell tests that 
study correlators, or the statistics of joint measurement outcomes. We associate 
particular maps, or functions to particular theories. The violation of a Bell 
inequality then implies the ability to perform some functions, or computations 
that classical, or more generally, local hidden variable (LHV) theories cannot. 

We derive an infinite class of Bell inequalities that establish a link to so- 
called "non-local games" |Cleve2004] . We then make the connection between 
Raussendorf and Briegel's formulation of Measurement-based Quantum Com- 
puting (MBQC) |Raussendorf2001j . and these non-local games. Not only can we 
show that a quantum violation implies a computational advantage in this model, 
we show that adaptive measurements are required to perform all quantum com- 
putations. 

Finally, we explore post-selection of data in Bell tests from both a practical and 
conceptual point-of-view, with particular consideration to so-called "loopholes". 
Loopholes allow LHV theories to simulate quantum correlations through post- 
selection. We give a computational description of how loopholes can emerge 
in different post-selection scenarios. This motivates us to find a form of post- 
selection that does not lead to loopholes. Central again to this discussion is the 
description of LHV theories in terms of computations. 

Interestingly, quantum correlators can be made more "non-classical" with this 
loophole-free post-selection. This method of post-selection also can simulate 
information processing tasks, such as MBQC, that have time-like separated com- 
ponents. This opens up new avenues for the study of time-like tasks studied 
within the space- like separated scenario of the Bell test. 
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1 Introductions 



"/ tell you, we are here on Earth to fart around, and don't let anybody tell you 
different. " 

-Kurt Vonnegut 

If this thesis has one central motivation it is this: to explore the interplay between 
the foundations and applications of quantum physics. The emergence of quan- 
tum information (the application of computer science ideas to quantum physics 
|Nielsen2000j ) has motivated new insights into quantum mechanics. Indeed, new 
interpretations of quantum physics have been influenced by information theoretic 
concepts (e.g. |Caves2002j ). In turn, ideas in quantum foundations have inspired 
new technological ideas and applications (e.g. [EkertlOOTj IWoottersl982] ). The 
hope is that this work contributes to this fertile area of research by considering 
quantum mechanical correlations from a computational point-of-view. 

In discussing the interplay between computation and correlations (in particu- 
lar correlations of measurement statistics) , we will discuss issues central to both 
computer science and quantum theory. Before we can address these issues we 
need to introduce basic concepts in quantum mechanics and quantum informa- 
tion. We will also mention how ideas in the foundations of quantum mechanics 
have inspired new applications of quantum theory, with a particular focus on the 
Bell inequality |Belll964j . 

First, we introduce quantum mechanics and discuss the concept of entangle- 
ment |Schrodingerl936l . Einstein, Podolsky and Rosen used entanglement to 
argue that quantum mechanics is an incomplete theory |EPR1935] . This leads 
us to discuss Bell's argument that quantum mechanics is incompatible with "local 
realism" |Belll964] . This incompatibility is epitomised by a violation of a Bell 
inequality |Beni964[ [n"HSH1969j . 

After the above discussion, we give a brief overview of quantum information 
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science. We indicate that entanglement has been shown to be a resource in 
quantum information |Nielsen2000| . The incompatibility of quantum mechanics 
with local realism is also a resource for certain tasks: device-independent quan- 
tum information protocols |Mayers98l IAcm200"7t IPironio2010j . We finish by dis- 
cussing possible connections between Bell inequalities and Measurement-based 
Quantum Computing |Raussendorf200T] : the latter utilises entangled states to 
perform computational tasks. All of the work in this chapter is introductory and 
does not consist of new results produced by the author of this thesis. 



1.1 Quantum Mechanics and Entanglement 

In this section, we give a brief overview of the postulates of quantum mechanics. 
We also look at one of the consequences of these postulates: entanglement. There 
are very many clear and excellent pedagogical introductions to the quantum 
formalism (e.g. jPeresl993l, INielsen2000| ). We base our introduction on that of 
Nielsen and Chuang |Nielsen2000] . The more relevant aspects of quantum theory 
will be emphasized, especially with regards to measurements. 

Quantum mechanics is a mathematical framework for making predictions of 
outcomes of experiments. The problem of how this framework relates to a picture 
of physical reality is still open. An interesting research direction is to recover the 
quantum formalism from a set of axioms rooted in less mathematical, or more 
physical principles (e.g. |Hardy2001 , IChiribella2011| ). This subject will not be 



addressed in this thesis as it would be too much of a diversion from our discussion. 
Although, the issue of realism in a limited form will be encountered in section 

1.1.1 Postulates of Quantum Mechanics 

In this subsection, we assume familiarity with linear algebra, complex vector 
spaces and Dirac notation ( |Nielsen2000j is an excellent reference for these sub- 
jects). Physical systems described by quantum mechanics are associated with 
a complex inner product vector space, or Hilbert space T-L. This idea can be 
formalised in the following postulate taken verbatim from |Nielsen2000] . 

Postulate 1 |Nielsen2000] . Associated to any isolated physical system is a com- 
plex vector space with inner product (that is, a Hilbert space) known as the state 
space of the system. The system is completely described by its state vector, which 
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is a unit vector in the system's state space. 



A unit vector 1-0) in this Hilbert space Ti must satisfy {ip\ip) = 1, where {ip\ is 
the dual vector to in the dual Hilbert space Ti* . For two-dimensional Hilbert 
spaces, all unit vectors are called "qubits" (quantum bits) and can be written as 
\ip) = a\0) + (3\1), where a and (3 are complex numbers satisfying |ap + |/3p = 1. 
By convention we choose the basis states in a d-dimensional Hilbert space Ti 
(where d is finite) to be \j) where j S {0, 1, {d — 1)]Q 

More generally, quantum states can be associated with "density matrices" p, 
or an element of the space L(T-L) of linear operators on T-L. We may need to 
consider density matrices for physical systems that are not isolated or when an 
experimenter is not sure which state |0) a system is in; they assign probabilities 
to the possibilities. These density matrices represent statistical ensembles of 
the unit vectors described by Postulate 1. The unit vectors \ip) are associated 
with "pure states" that are the density matrices p = \xp){ip\. If p is a statistical 
ensemble of pure states \^pj) then it can be represented as 



where j labels all possible pure states in an ensemble. The probabilities pj are 
associated with each pure state {ipj) where J2jPj — ^ Pj — ^■ 

For density matrices, the inner product is generalised to the operator trace 
Tr(...) such that Tr{p) = J2j Pj'^^{\^j){''Pj\) = due to the cyclicity of trace. 
This is one of the conditions that a density matrix must satisfy along with the 
positivity condition p > 0. This second condition is satisfied for any arbitrary 



state \^) en as (</.|H0) = EiPjM^j)\^ > due to MiPj)\^ = {cP\iPj){i,j\^). 



The second postulate describes how quantum states can be transformed over 
time. Again this and all postulates are reproduced verbatim from |Nielsen2000| . 

Postulate 2 |Nielsen2000j . The evolution of a closed system is described by a 
unitary transformation. That is, the state {ip) of the system at time ti is related 
to the state {tp') of the system at time t2 by a unitary operator U which depends 
only on the times ti and t2, 



^The set of integers {0, 1, (d — 1)} can be described in terms of the cyclic group Z^j. 
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We immediately see that a unitary operator preserves normalisation of a state 
as {ip'\ip') = {tplU^Ultp) = {ip\ip) = 1 where W is the adjoint of U so that 
U^U = 1, the identity matrix. Unitary operators can also be applied to a density 
matrix as 

UpU^ = Y,P,U\^P,){^P,\U^ = Y,Pj\i^j){i^j\- (1-3) 
j j 

For open systems (i.e. systems that are not closed) we can generalise the unitary 
operator to a linear operator that must be completely positive and not increase 
the trace of p. The next postulate of quantum mechanics relates to measurements 
which are a form of completely positive and non-trace- increasing linear operator. 

Postulate 3 |Nielsen2000| . Quantum measurements are described by the collec- 
tion {Mm} of measurement operators. These are operators acting on the state 
space of the system being measured. The index m refers to the measurement out- 
comes that may occur in the experiment. If the state of the quantum system is 
\tl)) immediately before the measurement then the probability that result m occurs 
is given by 

p{m) = {MMlMM, (1.4) 
and the state of the system after the measurement is 

M.ra\tlj) 



\MlMm\ 



(1.5) 



The measurement operators satisfy the completeness equation, 

Y,MlMm = l. (1.6) 

m 

Again, the above postulate can be extended to density matrices p where p{m) 
becomes p{m) = Th:{pMmMm) and the state of the system after measurement is 
now 



Ti{pMlMmj 

Therefore measurement operators Mm act on density matrices in an analogous 
fashion to unitary operators. In calculating the probabilities of particular out- 
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comes and satisfying the completeness relation, Mm and Mm always appear to- 
gether. For probabilities of measurement outcomes we rewrite MmMm as an 
operator Em associated with a measurement outcome m. The operator Em is 
called an "element" of a Positive Operator- Valued Measure (POVM) and is a 
positive operatoij^ such that Ylm^rn = I with probabilities p{m) = Tr{pEm)- 
The set of operators {Em} is then a POVM. 

A special case of all possible measurements is the von Neumann projective 
measurement (PVM). This is the set {Pm} where each element Pm is a projector 
associated with a measurement outcome m. These projectors satisfy an orthog- 
onality constraint PmPm' = ^m'Pm and for an arbitrary pure state \ip), the state 
after a PVM is oc Pml'ip)- Then, to satisfy the orthogonality constraint, we 
need A'' orthogonal vectors li^m) to describe the projectors Pm = \i^m){i^m\, where 

is the number of possible outcomes m of a measurement. 

A PVM can be associated with an "observable" which matches each projec- 
tor Pm of a PVM with a real eigenvalue Am- This observable O can be written 
as O = J2m ^mPm where Am is an observed outcome. The eigenvalue Am cor- 
responds to a system being projected into the eigenstate \fJ,m) associated with 
-Prrjfl For example, for a two-dimensional Hilbert space, we can have observables 
with eigenvalues Am = ±1 associated with two-dimensional vectors |/^m) where 
m takes two possible values. 

There is a beautiful result due to Naimark that shows that any POVM on 
a quantum state can be associated with a PVM |Paulsen2003j . That is, every 
POVM acting on a Hilbert space Ti can be implemented with a PVM on a larger 
Hilbert space /C. We can obtain some auxiliary (often referred to as an ancilla) 
system and take the composite of this system and our original Hilbert space T-L 
and perform a PVM on this new space. In order to consider composite systems 
we need to introduce the next postulate. 

Postulate 4 |Nielsen2000j . The state space of a composite physical system is the 
tensor product of the state spaces of the component physical systems. Moreover, 
if we have systems numbered 1 through n, and system number i is prepared in the 
state {i/ji), then the joint state of the total system is {ipi) \ip2) (X" ... ^ \tpn)- 



■^For all choices of states 10), {4>\Em\(j>) is a probability by definition, so Em is a positive 
operator. 

^These are eigenvalues and eigenstates as 0|/im) ~ ^m\l^m)- 
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We can replace pure states IV^i) in this postulate with density matrices pi. 
Composite systems can be represented by density matrices as linear operators 
on a tensor product Hilbert space, i.e. p G L{T-ii 'H2 ® ■■■ ® Tin) where Tii is 
the Hilbert space of each ith system. While in the postulate, we mention one 
pure state, 'S)7=i iV'i) = iV'i) {"^2) ■■■ CS> \ipn), in particular, this is not the most 
general pure state in a composite Hilbert space (S)7=i ^* ~ ^1 ^2 (X" ... (8) Tin- 
The state 'S)7=i iV'j) is a "product state", but pure states that cannot be expressed 
in this form are said to be "entangled". This property will be discussed in the 
next subsection. 

We have given a brief overview of the mathematical construction of quantum 
mechanics. In this thesis, we will be utilising the definition of a measurement and 
the description of composite systems. If a composite system consists of two space- 
like separated systems Tii and T-L2, then experimenters in each of these space- 
like separated systems can perform measurements on each of their respective 
subsystems. This way measurements can be written as a tensor product of these 
localised measurements, i.e. A^^(8>A1^ where Ailn ^ L,{'Hi), a linear operator on 
Tii. Assume that one can prepare all possible states (by whatever means) on the 
composite system, i.e. p £ L{T-Li 0'H2)- If the measurements are performed on 
entangled states then the statistics produced by this total system do not always 
factor ise, i.e. 

p{m,m') = Tr{pMl ^ M^,) + ^p,Tr(pi,,.M^)Tr(p2,iA^^0 (1-8) 

i 

as p is not necessarily equal to J2jPjPi,j ® P2J where pij corresponds to a pure 
state of the ith system in the jth term of the decomposition of p. 

This inability for the statistics of space-like separated measurements to be 
factorised will be central to the discussion of quantum correlations in this thesis. 
Entanglement is central to this subject. In the next subsection we will briefly 
discuss entanglement and how it can be quantified. 

1.1.2 Entanglement 

Schrodinger first introduced the term "entanglement" |Schr6dingerl936] . This 
concept has become formalised for all possible density matrices p. First we 
describe systems in a bipartite scenario, that is where the Hilbert space of the 
system in question is the tensor product of two Hilbert spaces. An entangled 
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state represented by a density matrix p cannot be expressed as 



p = J]KlV'->(V''|€D|V^f)(Vf|. (1.9) 

i 

The pure state I is the jth party's state for the ith pure state in the 

probabiHstic ensemble of p. There may be multiple, even infinite possible de- 
compositions of p into a convex combination of pure states ® 
For example, the density matrix p = |I in a composite Hilbert space of two, 
two-dimensional Hilbert spaces can be written as 

p = J(|00)(00| + |01)(01| + |10)(10| + |ll)(ll|) 

= ^(l++>(++l + l + ->(+- l + l-+)(- + l + l --)(-- 1), 

(1.10) 

where \+) = ^(|0) + |1)) and |-) = ;^(|0) - This multiplicity of de- 

composition makes it difficult to ascertain whether an arbitrary density matrix 
is entangled or otherwise. 

If a density matrix is a bipartite pure state, then there is a definite method 
to detect whether this state is entangled or not |Popescul997 lPrenio2007] . This 



method of detection also can quantify the amount of entanglement. For mixed 
states, this detection is a hard problem to compute |Gurvits2002j . 

The method of detecting entanglement for bipartite pure states involves finding 
the "Entropy of Entanglement" |Popescul997, . To calculate this quantity, first 



one needs to find the reduced density matrix of pi and p2 corresponding to party 
1 and 2. The reduced density matrix is calculated from the partial trace of p, 
where we only take a trace over one party's system instead of the whole composite 
system. The partial trace of p over system 1 of two systems is written as Txi{p) 
and is calculated as 

Tvi{p) = Y,{h\p\ii), (1.11) 

i 

where \ii) are basis states on system 1. Without loss of generality, we assume that 
all subsystems have the same dimensional Hilbert space. We then calculate the 
von Neumann entropy S{pi) |vonNeumannl955] of this reduced density matrij^ 



*We make the standard abbreviation of omitting the tensor product for composite pure states, 

e.g. |0) ® |0) becomes |00). 
^The von Neumann entropy is the same for either sub-system [Nielsen2000] . 
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pi- 

5(pi) = -TV(/,ilog2(pi)). (1.12) 

If S{pi) = 0, then the reduced state pi is a pure state and so p = pi® p2 with 
both pj being pure states. Importantly if S{pi) > then the pure state p is 
entangled. For S{pi) = 1, then pi = ^I. The state p that results in S{pi) = 1 is 
the "maximally entangled state" of two qubits, as it gives the maximum value of 
S{pi) for two qubits. The maximally entangled state |^') of two d-dimensional 
systems can be written as 

, (d-i) 

If we take the partial trace over system 1, then 

(d-i) /(d-i) 

i=0 \ j=0 

1 ^'"'^ 1 

j=0 

In the case of two qubits we retrieve the value of entropy mentioned above, but 
in general, for these states S{p2) = S{pi) = log2((i). 

We have only discussed the bipartite case. In this thesis, we will also be 
interested in multipartite quantum systems. The definition of an entangled mul- 




tipartite state is now where an entangled state cannot be written as (1.9) but 
with \'il^j){ipj\ <8> \ now replaced with (S)j=i iV'i I- Entanglement of 

multipartite systems is relatively less well-studied but there do exist measures of 
entanglement in this scenario [Plenio2007] . There is also not one particular max- 
imally entangled state for the multipartite setting like there is for the bipartite 
setting. 

So far entanglement has been discussed as a mathematical construct and we 
have not discussed its physical consequences. In the next section we will discuss 
the impact of entanglement upon the foundations of quantum mechanics. That 
is, it causes a tension between quantum physics and a classical physics view of the 
world |Bell2004] . If quantum mechanics describes what is actually happening in 
the world then we need to accept some behaviour that is potentially incompatible 
with everyday intuition. We will make these issues more rigorous in the next 
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section. 



1.2 EPR Paradox and Bell Inequalities 

Albert Einstein played a crucial role in the development of quantum theory 
|Einsteinl905j . However, upon being developed formally, he was famously dissat- 
isfied with it. At its core, quantum mechanics predicts probabilities, and does not 
always make deterministic prediction^ It could be argued that this probabilistic 
feature convinced Einstein that quantum mechanics was a statistical theory akin 
to classical statistical, or Liouvillian mechanics |Liouvillel838] . In Liouvillian 
mechanics objects have defined positions and momentum, but we may not have 
complete knowledge of these properties . Therefore, a probability distribution is 
assigned over a space of potential properties of a system. The state could 
also resemble a probability distribution over some underlying reality describing 
a system. For more discussion of Einstein's potential view of quantum physics, 



see work by Harrigan and Spekkens Harrigan2011|. 



A particular focus for Einstein's criticism of quantum mechanics became the 
issue of "locality". Locality has many different guises but we heuristically use it 
here in the sense that events in space-time can only "affect" each other if they 
are within each other's light-cone. It has been suggested by Bacciagaluppi and 
Valentini that Einstein had an argument against quantum theory based on a 
violation of locality at the 1927 Solvay Conference |Bacciag aluppi2009| . This 
discussion is beyond the scope of this thesis but we only mention it as a prelude 
to the argument presented by Einstein, Podolsky and Rosen (EPR) |EPR1935] . 
often called the "EPR paradox']^ 

1.2.1 Realism and "Incompleteness" of Quantum Mechanics 

In the original EPR paper, they argued that if one can predict a physical property, 
or quantity, with certainty then we associate that quantity with an "element of 
reality" |EPR1935i . If by the definition of EPR, a theory is "complete" then the 
properties that are found with certainty must be incorporated into the theory 
describing the system. Take two observables Oi and O2 that do not commute. 



^Einstein's dissatisfaction can be summarised with one of his famous playful quotes: "... 

He[God] does not throw dice." |Ein stcinl971 
'^The paradox being that if one accepts a particular picture of reality, then quantum mechanics 

contradicts this picture. It is not a paradox in the sense of demonstrating that quantum 

mechanics is inconsistent. 
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i.e. [Oi,02] 7^ 0, and a state \'4j) being an eigenstate of Oi (with eigenvalue 
A). If we make the constraint that the two observables do not share eigenstates 
nor are any of the eigenstates of one observable orthogonal to eigenstates of 
the other. We can predict the outcome A of observable Oi with certainty, but 
cannot predict the outcome of O2 with certaint}|^ This means that we can only 
associate the observable Oi with an element of reality but not both observables. 
The following contradiction emerges if one asserts that elements of reality can 
only be associated with commuting observables. We follow Bohm's version of the 
EPR argument |Bohml951| . 

Imagine that two parties share the entangled state (that is equivalent to the 
maximally entangled stat^: 

l^>EPR = ^(|01)-|10)), (1.15) 

such that one party has access to one of the two-dimensional subsystems, or 
qubit, and the other party has access to the other qubit. We have put no 
constraint on the distance between the two parties, and in fact we make them 
space-like separated. The first party makes measurements of the observables 
^ = (l+)(+|-|-)(-|)orZ=(|0)(0|-|l)(l| 



- 10 



Each observable is associated 



with outcomes, or eigenvalues ±1 and projectors Pm associated with this eigen- 
value. According to EPR because the parties are space-like separated they can 
no longer "interact", and regardless of the observable performed by the first party, 
we must assign the same elements of reality to the second party |EPR1935] . 

If party 1 measures X and gets +1 or —1 then the second party's state will 
be |— ) or 1+) respectively with certainty (upto a global phase). Since we can 
predict the second party's state with certainty we must assign this property with 
an element of reality. If, on the other hand, party 1 measures Z then for outcomes 
+1 or —1 the second party's state will be |1) or |0) respectively (upto a global 
phase). Again, we can assign an element of reality since after the measurement, 
the first party knows the second party's state with certainty. 

To summarise, if party 1 measures X, then we can assign an element of reality 
with the second party's observable X. When party 1 measures we assign an 



*If we make a measurement of the observable Oi on ]?/;), we obtain A so that the projection 
Pm = \ip){tp\ has been performed on \ip), giving the probability p(m) — {^{ip) {tp\tjj) — 1. 
However, O2 consists of projectors Pm = \(t>){4'\ where It/;) 7^ so for O2 

p{m) = (mm) + 1- 

®One applies the unitary I ® 17 such that U = jO)(l| — ll)(0| to both qubits. 
^°These are the Pauli-X and Pauli-Z measurements respectively. 
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element of reality with the second party's observable Z. Since the measurements 
performed by party 1 are space- like separated from party 2, the elements of reality 
for party 2 should not be affected by the first party's measurements. This is the 
locality argument in the EPR paradox. However, X and Z do not commute, so we 
cannot assign an element of reality to each observable arriving at a contradiction. 
EPR reasoned that this contradiction means that quantum mechanics does not 
result in a complete picture of reality |EPR1935] . 

John Bell formalised the language of the EPR paradox away from the discussion 
of "incompleteness" and "elements of reality" into more mathematically rigorous 
concepts |Belll964l IBell2004] . He showed that the assumption upon which the 
EPR paradox is based is that all physical systems obey "local realism" [Bell2004] . 
Local realism combines two separate assumptions invoking locality and realism 
and can be seen to limit the statistics of space- like separated measurements. In 
the following subsection we will briefly review local realism and show that it puts 
constraints on these statistics. 

1.2.2 CHSH Inequality 

We will describe local realism mathematically in section |2.1| of the next chap- 
ter but for now, we review the work of Clauser-Horne-Shimony-Holt (CHSH) 
ICHSH1969] . The seminal work of Beh |Beni964j led to the formulation of the 
Bell inequality. This work was developed by CHSH into a mathematical expres- 
sion that can be experimentally testable: the CHSH inequality. 

We now describe the Bell-CHSH scenario, or "test" |CHSH1969j . There are 
two parties and each party chooses between two measurements. The choice of 
measurement is a completely random, free choice of the parties. This is a key 
assumption in the construction of Bell inequalities |Bell2004] (for consequences 
of dropping this assumption see |Barrett201H IHall201l] ) . Each measurement has 
two possible outcomes ±1. The measurements that the jth party chooses from 
are A^^ and A^j. These measurements can be described by an arbitrary theory 
and not just quantum theory. The statistics in this experiment that will be of 
interest to us are the correlations of the form 

¥.{M\Mi) = viMlMi = 1) - v{M\Mi = -1), (1.16) 

the expectation values of the joint outcome of both parties' measurements for 
choices fc. A;' S {0, 1} where "p^M^M^ = ±1) is the probability of getting the 
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joint measurement outcome ±1. 

There is actuahy a class of CHSH inequalities for this scenario |Finel982] , but 
we just pick out one particular expression 

E(7W?7W^) + EiM'iMl) + E(7W}7W^) - E{M\mI) < 2, (1.17) 

where the upper bound of 2 is satisfied for all physical systems that satisfy local 
realism |CHSH1969j . Local realism means outcomes of A4j are only dependent 
on some set of objective properties of each party's system. Secondly, these prop- 
erties (which can be seen as elements of reality) are localised to each space-like 
separated region. Whilst they may have been shared properties when parties 
were not separated in the past, they are not affected by anything outside of their 
region. A locally realistic property for measurement is then £ {=^1}, so 



we can write the left-hand-side of (1.17) as 



nx'iixl + Xl) + Xlixl - XD) = Y.Px ix'iixl + Xl) + X\ix'2 - XD) (1.18) 

X 

where is a probability distribution over all possible assignments of Xj to 
measurements such that '^^Px = 1 and p^. > 0. By convexity we can upper 
bound the right-hand-side of ( |1.18| ) by just considering the maximum value of 

Xi{X2 + xl) + Xi(X2 - xD- If iX2 - xl) is non-zero then {x2 + xl) will be zero, 
resulting in 

Y.PX + xl) + XW2 - xD) < 2. (1.19) 



This expression then gives exactly the same right-hand-side of (1.17). 

This result is interesting as we can derive a consequence of a theory with very 
few prior assumptions. More importantly though, in the following theorem, we 



can actually say something about quantum theory using the expression in (1.17) 



Bell's Theorem [Belll964] : The predictions of quantum mechanics are not com- 
patible with a locally realistic theory. 



Proof: To prove this theorem, we just need to show that the inequality (1.17) 
is not satisfied for all predicted values of E(A1^Al2') in quantum theory. We 
prove this by example. If two space-like separated parties share the state |\I')epr 
and make the measurements Aii = X, = Z, = :^(~^ ~ ^)) ™d 
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Ml = ^{Z - X), then the left-hand-side of ([TTt]) is 



J X(^{-Z-X) ^ X 



(Z-X) Z®[-Z-X) Z®[Z-X) 



V2 



V2 



+ 



V2 



V2 



(1.20) 



We have used the short-hand notation ((...)) = (V"! where = |^')epr- 
Calculation of the right-hand-side of ( |1.20[ ) yields a value of 2\/2, which is greater 
than 2, thus violating the CHSH inequalitjj^ Therefore quantum mechanics is 
incompatible with a theory satisfying local realism. □ 



The simplicity of the theorem and its proof has remarkable implications for 
the foundations of quantum mechanics. It means we must abandon the intu- 
ition of local realism, a constraint satisfied by classical physical systems. If the 
predictions of quantum theory are experimentally verified then if measurement 
outcomes result from elements of reality, then this reality does not satisfy local- 
ity. Or we could just abandon realism all together and not have to worry about 
locality. 



1.2.3 Geometric Construction of Bell Inequalities 

Beginning with the work of Froissart |Froissartl981j , then developments by Fine 
|Finel982] , Pitowsky |Pitowskyl989| and Peres [Peres 1999] , the geometric picture 
of Bell inequalities has been well-developed. Correlations of space-like separated 
measurements are now elements of a vector in some real space. The space of 
correlations satisfying local realism is a convex polytope which can be described 
in terms of linear inequalities |Grunbaum2003| . These linear inequalities are 
examples of Bell inequalities. Finding these inequalities is then a problem in 
convex geometry. 

This polytope approach to Bell inequalities is now an effective way of un- 
derstanding the consequences of local realism. We will elaborate on and de- 



scribe this approach in section 2.1.3 of the next chapter. Also we will com- 
ment on the hardness of finding the Bell inequalities that define the polytope 
of locally realistic correlations. The convex geometric approach has also been 



^This value of 2^2 is known as Tsirelson's bound [Tsirelsonl980) as it is the largest possible 



quantum value of the left-hand-side of ( 1.20 1 
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extended to the study of correlations that satisfy only a form of locality: space- 
like separated measurements that do not allow instantaneous communication 
|Barrett2005b| lP"ironio20l"T] . These issues will be discussed in section 2.3 



1.2.4 Experimental implementations for testing local realism 

Testing whether quantum mechanics violates a Bell inequality in the laboratory 
is a difficult task. Firstly, measurements have to be space-like separated but 
transporting fragile quantum states over large distances can be hard. States may 
interact with the environment and become mixed states that are no longer en- 
tangled. Secondly, apparatus in the lab is not perfect and detectors may not 
always perfectly detect a measurement outcome. These difficulties can lead 
to "loopholes" (as we shall discuss in section 4.1 of chapter |4]) whereby locally 



realistic theories are no longer constrained by the Bell inequality being tested 
|Pearlel97n[ |Gargl987 . 



If we do not have space-like separated measurements then the local aspect of 
locally realistic theories is not constrained and we have the "locality loophole". 
For imperfect detection, the associated "detection loophole" is more subtle as it 
allows the possibility that the objective properties of a system can describe the 
statistics of detection |Pearlel970j . If we make the extra assumption that prop- 
erties of the system we are observing are independent of the detection system, 
often called the "fair-sampling assumption" |Clauserl978] . then violations of a 
Bell inequality have been observed in photonic systems |Aspectl981 IWeihsl998j . 



Without this extra assumption, then ion-based systems have got around the de- 
tection loophole but suffer from the locality loophole |Rowe200T] . At the time of 
writing this thesis, completely loophole-free Bell inequality violations have not 
been observed. Although, there are promising avenues for future experimental 
work |Matsukevich2nn8[ IVertesi2ninj . In chapter [Ij we will give a more thorough 
discussion of loopholes in Bell tests. 



1.2.5 The GHZ Paradox 

Bell's theorem can be proven using the now-famous Bell inequality. Did we need 
to construct this expression? There have been several arguments which have 
shown that quantum mechanics is incompatible with local realism but with- 
out use of a Bell inequality. For example, in 1983, Heywood and Redhead 
|Heywoodl983| developed a proof that local realism cannot be compatible with 
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the statistics of two space-like separated, yet entangled spin-1 systems. This proof 
relied on an argument of determinism, in the spirit of the original EPR argument 
[EPR1935] . Later in 1989, Greenberger, Horne, and Zeilinger (GHZ) developed a 
proof of Bell's theorem without inequalities for three space-like separated parties 
|GHZ1989] . The GHZ argument has subsequently been developed by Mermirp^ 
|Merminl990t IMerminl993j . Another notable example of Bell's theorem without 
the inequality is "Hardy's Paradox" which can be seen as a "possibilistic" proof, 
i.e. some things are possible in quantum mechanics that are not possible with 
locally realistic theories |Hardyl993| . We now present the GHZ argument, or 
"GHZ paradox" to which it is often referred, as a simple and beautiful proof of 
Bell's theorem. 

We have three, space-like separated parties who (like in the CHSH construc- 
tion) each have a completely free choice of measurement from a set of two mea- 
surements. We label the two measurements for the jth site Ai^ and Ai^ and 
each measurement takes one of two possible outcomes ±1. As with the CHSH 
construction, each outcome is then a result of some objective property of each 
party's local system (which may have been shared in the past). Therefore, each 
measurement Aij again is assigned the value Xj £ {=^1}. Again we are interested 
in the correlations E(A^^A^2-^3^) where k, I, m ^ {0,1}. If we now obtain the 
following deterministic correlations for a particular set of measurements 

E{M^^MlMl) = -1 (1.21) 
E{M^^MlMl) = -1 (1.22) 
E{MlMlMl) = -1 (1.23) 

then we can assign values of deterministically to E(A^^A^2-A4™) = XiX2X3^- 



we multiply rows (|1.21|), (|1.22|), and (|1.23|) together after they have been assigned 
values of Xj 
must obtain 



values of Xj and observe that (Xj)^ = Ij then in a locally realistic theory, we 



E{M\MlMl) = xlxlxi = -1. (1.24) 



However, measurements on an entangled quantum state can satisfy ( 1.21 ), ( 1.22 ), 



and (1.23) but contradict (1.24). The entangled state consists of three qubits 



|^)ghz = I (-100+) + |01-) + |10-) + |11+)) , (1.25) 



^^This argument was a development of a proof that quantum mechanics is "contextual" by 
Asher Peres developed into a proof of Bell's theorem. 
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where each j'th site has one of these qubits and performs the measurements 
Ai^ = X oi = Z. Calculating all expectation values, the statistics from 



these measurements on the state |^')ghz satisfy correlations in (1.21), (1.22), 



and (1.23). However, 



¥.{M\mImI) = {Z (g) Z (g)X) = +1, 



(1.26) 



thus contradicting (1.24). These quantum correlations have deterministically 



shown that local realism is inconsistent with quantum mechanics. 

Mermin showed that we can still construct a Bell inequality from the correla- 
tions of the GHZ argument |Merminl990] . We construct the following inequality 



-E{M'^^M°2Ml)-E{M°iMlMl)-E{MlMlMl)+E{MlMlMl) < 2, (1.27) 



which the correlations in (1.21), (1.22), (1.23) and (1.24) satisfy. Mermin showed 



that this inequality is satisfied for all locally realistic theories |Merminl990j . 
but the quantum mechanical correlations described above give a value of 4 for 



the left-hand-side of (1.27). We will show in section 1.4 that the GHZ-Mermin 



argument against local realism in quantum physics will be relevant to discussion 
about quantum information. 

The CHSH inequality and the GHZ argument are ways of putting constraints 
on what is possible in a classical, or more generally, a locally realistic theory. The 
fact that quantum mechanics predicts contradictions to both constraints gives a 
remarkable departure from a classical view of the world. It indicates that when 
we are utilising the quantum mechanical formalism we can produce non-classical 
phenomena. One of the most enticing prospects for quantum mechanics is to use 
non-classical behaviour to perform some useful task that we could not achieve 
with classical resources. This motivation has led to the relatively nascent field 
of "quantum information science" |Nielsen2000| . One of the goals of this field 
is to process information via computation or communication and use quantum 
mechanical systems to do this "better" than with classical resources. In the next 
section we will give a broad overview of the field and how quantum systems could 
out-perform classical systems. 
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1.3 Quantum Information Processing 



We have seen how quantum physics can be seen as non-classical in some con- 
crete sense. Quantum information science has been developed to answer whether 
the non-classicality of quantum physics can be used to perform information 
processing tasks thought difficult or intractable with classical physical systems 
|Nielsen2000j . We now give a broad, and incomplete, overview of the field of 
quantum information in order to show that quantum resources can be useful for 
information processing. 

The history of quantum information is itself an interesting topic for discus- 
sion. Stephen Wiesner developed the idea of "conjugate coding" circa 1970 but 
the result was not published until the 1980s [W iesnerl983] : this idea went on 
to infiuence the field of quantum cryptography. Alexander Holevo published his 
famous theorem in 1973 limiting the classical information in, say, a qubit to be- 
ing at most one classical bit |Holevol973j . Holevo's theorem is one of the most 
significant results in information theory applied to quantum systems and quan- 



tum "channels' The idea of a "quantum computer", or some quantum system 
capable of performing computations was first suggested by Richard Feynman in 
1982 |Feynmanl982| ; the work of David Deutsch later formalised this concept 
|Deutschl985| . Wootters and Zurek showed that unknown quantum informa- 
tion cannot be copied, called the "no-cloning" theorem |Woottersl982j . In the 
light of all of this work, we begin our discussion in the next subsection in 1984, 
with the seminal work by Bennett and Brassard (BB) on quantum cryptography 
[BB1984] . This work by BB brought together the ideas of the no-cloning theorem 
and conjugate cloning in a simple yet powerful way. 



1.3.1 Quantum Cryptography 

Two parties, referred to as Alice and Botj^want to communicate to each other 
without fear of eavesdroppers intercepting their messages. Alice encodes her 
message into another message or "ciphertext" with a "key" that Bob knows but 
no-one else does. Bob can use the "key" to unlock Alice's message from the 



Quantum channels consist of the positive hnear operators on some "input" quantum state, 

mapping this state to another state. Perhaps this channel is a perfect communication channel 

for qubits and so would be the identity operator I. 
These two characters have a long and auspicious career in computer science. Such is their 

success that the quantum information community talk often of Alice and Bob in quantum 

information procedures. 
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ciphertext. An eavesdropper can try and guess or calculate the key, but if it 
is random and Alice applies the "one-time pad", then a message can be made 
perfectly secure as defined by Shannon |Shannonl949j . The one-time pad consists 
of one bit of a message x G {0, 1} being added (modulo 2) to a random bit 
r G {0, 1} giving y, i.e. y = x ® r where © represents modulo 2 addition. We 
need at least as many random bits as there are bits in the message, but as long as 
Bob knows every one of these random bits he can recover x by adding (modulo 
2) r to y as y®r = x®r®r = X. Shannon showed that this makes the ciphertext 
secure if an eavesdropper cannot obtain all values of r |Shannonl949| . 

How does Alice share the key consisting of the values of r to Bob? Since their 
goal was to communicate securely in the first place, they must find a secure way so 
that each party can communicate the random key. In 1984, BB showed that the 
combination of publicly communicating quantum states |^) from Alice to Bob 
and publicly communicating classical information about these states between 
Alice and Bob, secure values of r can be generated |BB1984] . An eavesdropper 
cannot perfectly copy the state that is publicly communicated by the no-cloning 
theorem, so must make a measurement to learn \ip). The security is partly based 
on the fact that when an eavesdropper makes a measurement on the quantum 
state that is sent from Alice to Bob, they project the state into another state 
which may be different from \ip). If the eavesdropper projects into a different 
state, Alice and Bob can compare measurement outcomes on the state to detect 
this. If Alice and Bob proceed with a particular protocol, with public quantum 
and classical communication, they can generate a secure random key. We then 
describe this as a method of "quantum key distribution" (QKD). 

In 1991, Artur Ekert developed another method of QKD that utilised entangle- 
ment |Ekertl99T] . This result alongside the discovery of "quantum teleportation" 
[Bennettl993j based upon sharing entanglement and classical communication led, 
in earnest, to entanglement being investigated as a resource for quantum infor- 
mation processing. Ekert based his protocol on a modified version of the CHSH 
Bell inequality test where Alice and Bob each receive one-half of the bipartite 
entangled state |^')epr- The intuition behind the protocol is that a key is re- 
vealed by the act of space-like separated measurements on this entangled state; 
if a key existed before measurement it would be an "element of reality" and so 
incompatible with an entangled state. 

For a given choice of measurements as discussed in the EPR paradox, outcomes 
are perfectly correlated generating a shared random bit. Alice and Bob randomly 
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choose measurements and announce the choice after receiving measurement out- 
comes. An eavesdropper can intercept the quantum state before it reaches either 
Ahce or Bob and make a measurement, but this interception leaves the state in a 
separable state. They use the CHSH inequality to confirm that the state is entan- 
gled when they make measurements on it. Therefore, the protocol requires that 
the state is entangled and the CHSH inequality just confirms this, the security 
of the original 1991 protocol does not hinge directly on the incompatibility with 
local realism. Remarkably, in the spirit of Ekert's intuition, Barrett, Hardy and 
Kent designed a protocol whereby security was guaranteed by a Bell inequality 
violation |Barrett2005aj . Acm et al then made the connection to the original 
CHSH inequality that Ekert used (without assuming the quantum state shared), 
to confirm the security of a key jAcm200"7] . 



1.3.2 Quantum Computing 

If one does not use quantum cryptographic means to establish secure commu- 
nication, then what means are there to establish a secure key? One of most 
commonly used tools is the Ri vest- Shamir- Adleman (RSA) algorithm which is 
based upon a computational premise |RSA1978] . It is believed that it is hard 
for computers to find the prime factors of a large number. The RSA algorithm 
involves a public and a private key, where Alice makes the product of two large 
primes public and keeps these factors private. Bob receives the public key, en- 
codes his message using it and sends his ciphertext to Alice in such a way that 
it can only be decrypted using Alice's private data. Therefore, if one can find 
the two prime factors of the public key efficiently, one can decode the message. 
However, as we mentioned, it is believed that this cannot be done efficiently with 
current computers and Alice receives the information from Bob securely. The 
RSA algorithm, as a result, is used quite successfully in many internet-based 
financial transactions. 

Remarkably, if one could build a computer that works on quantum mechan- 



ics, a quantum computei one could find the prime factors of a large number 
efficiently, thus breaking the RSA algorithm. The algorithm for finding these 
prime factors was invented by Peter Shor in 1994 |Shorl99T] and became a key 
motivator for building a quantum computer. 

In 1985, David Deutsch described a universal quantum computer which can 



^^Current desktop PCs rely on quantum theory to describe their workings. A quantum com- 
puter full exploits the quantum formalism and is based on the postulates of the theory. 
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perform any possible quantum computation [Deutschl985j . A computation can 
be described in the "circuit model" of quantum computation where a quantum 
state consisting of n qubits is prepared in the product state |0) = ^^=i |0) 
|Nielsen2000| . A computation then consists of a sequence of unitary operations 
performed on these qubits. Each unitary is considered 1 computational step, or 
"gate". After the requisite number of unitary operators is performed, some, or 
all of the n qubits can be measured. 

Various algorithms have been designed for quantum computers indicating a po- 
tential improvement in computational time over classical computers |Deutschl9921 
IShorlOQTl IGroverl996t lHarrow2009j . This improvement is conjectured in compu- 
tational complexity terms as we currently do not even know the power of classical 
computers |Papadimitrioul994| . If quantum computers are more powerful than 
classical computers, then it would be of interest to know what aspect of quantum 
mechanics gives this improvement. It might even be the case that this property 
of quantum mechanics can assert the assumed separation between quantum and 
classical computers. Jozsa and Linden showed that in quantum computations on 
pure states, unbounded entanglement is necessary if there is to be a computa- 
tional speed-up |Jozsa2003] . This does not mean that if there is entanglement in 
pure state quantum computation, the circuit cannot be simulated efficiently on a 
classical computer. A "Clifford circuit" is an example of a such a circuit that can 
be simulated efficiently with a classical computer |Nielsen2000[ IAaronson2004| . 
It has also been shown by Vidal that if entanglement is bounded, then the quan- 
tum computation can be simulated efficiently classically |Vidal2003] . For quan- 
tum computations on mixed states, which will be those that are performed in 
the laboratory, the role of entanglement is unknown or possibly not even relevant 
|Jozsa2003l IDatta2005j . 



1.3.3 Measurement-based Quantum Computing 

There are several models of quantum computing that are equivalent to the circuit 



model of quantum computing |Raussendorf200 ll IZanardil9991 IAharonov2004t 
IKitaev20031 |Leung2dOT . In one particular class of models, the presence of en- 
tanglement is by construction a key ingredient in performing a computation. 
This is the class of models of Measurement-based Quantum Computing (MBQC) 
|Raussendorf200H IRaussendorf2003( IJozsa2003j . One of the origins of this model 



^^Equivalence means that every computation in one model can be efficiently simulated in an- 
other model. 
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can be seen in the teleportation-based quantum gate model developed by Gottes- 
man and Chuang |Gottesmanl999| INielsen20031 |Leung200T] . In teleportation- 
based quantum computing, n parties share bipartite maximally entangled states 
with their nearest neighbours, and make measurements at each site. Richard 
Jozsa has shown that this model is equivalent to a model proposed by Raussendorf 
and Briegel (RB) in 2001 |Jozsa2003l rRaussendorf200T] . 

The model of MBQC proposed by RB consists of a multipartite entangled 
state, or "resource state" shared by n parties |Raussendorf200l] . This state is 
the "cluster state" consisting of n qubits [Raussendorf200 1] IRaussendorf2003] . 
However, in our discussion, we allow any possible state to be shared by these 
parties (see section 3.4) and do not restrict this aspect of MBQC. Also, from 
now on when we mention MBQC, we make it synonymous with the original RB 
construction but with any possible resource state |Anders2009j . The computation 
proceeds by each site performing a measurement with two outcomes ±1 on their 
respective system. Measurements on, say, cluster states can have completely 
random outcomes, but a set of gates in the quantum circuit model corresponds 
to a set of unitary operators |Raussendorf200T] . The unitary evolution of a 
state is a deterministic operation. Remarkably, one can achieve determinism 
in MBQC by applying corrections at the end of the measurements and making 
measurements adaptive | Jozsa2003] . That is, the choice of measurement during 
the computation must be dependent on previous measurement outcomes. 

To take into account this correction and adaptivity, a crucial component of 
MBQC is needed: the "classical control computer" |Raussendorf2003] Briegel2009 



IAnders2009| . This computer is a classical processor and processes bits corre- 
sponding to a choice of measurement and its respective outcome at each site. 
In the model of RB, one just needs a choice between two measurements at each 
site to get a universal quantum computer labelled by bit-values. The outcomes 
of a measurement are ±1 = (—1)^ as described above where x is now a bit- 
value. In MBQC as formulated by RB, the control computer does not require 
all possible operations, or gates in classical computing. In fact, as pointed out 
by Anders and Browne, all that is needed is modulo 2 addition between classical 
data |Anders2009| . Using only these operations, a computer cannot perform all 
logical, or Boolean operations, and is therefore not functionally complete for all 
classical computations. 



31 



1.3.4 Entanglement as a Resource 

In MBQC, entanglement can be seen as a resource that is "consumed" via single- 
qubit measurements |Briegel2009| . This model also provides a nice distinction 
between the quantum and classical parts of computation; the quantum part be- 
ing the measurements on a quantum state and the classical control computer 
providing some, albeit limited processing to utilise this measurement data. 

The idea of entanglement being a resource for information processing that is 
consumed can be seen in many aspects of quantum information |Horodecki2009j . 
Historically, beginning with entanglement as a resource for producing secure 
keys, then used as a channel for communicating quantum states via teleporta- 
tion |Ekertl99H IBennettl993j . The interplay between quantum gates and tele- 
portation as highlighted by Gottesman and Chuang, also highlights the role of 
entanglement with respect to computation |Gottesmanl999| IJozsa2003] . Also 
relevant to quantum computation and communication, entanglement has been 
utilised as a resource for correcting errors |Brun2006] . 

Inspired by these information processing tasks, the resource theory of entan- 
glement has been developed |IIorodecki2009j . If parties are restricted to being 
only able to perform local operations on their respective subsystem and commu- 
nicating classical information (LOCC), then they cannot produce an entangled 
quantum state |Horodecki2009] . Therefore, if parties have an entangled state, 
they can do tasks that they otherwise could not do with only LOCC. This the- 
ory has become well-developed and we refer the reader to [Horodecki2009| for a 
review of entanglement in quantum information. 

1.4 Bell Inequalities and Quantum Information 

Since entanglement is a resource for information processing, and entanglement 
was used to show an incompatibility of quantum physics with local realism, can 
this incompatibility also be used as a resource? In recent years, the answer 
to this question has been answered in the affirmative. The intuition behind 
Ekert's 1991 QKD protocol that if a key is some element of reality held by each 
party then an eavesdropper can threaten security and learn this data |Ekertl99T] . 
As mentioned, Barrett, Hardy and Kent developed this intuition |Barrett2005aj 
and then Acm et al made the connection between security and Bell's theorem 
concrete |Acm200"7] . They showed that if we put no constraint on the devices 
that Alice and Bob use (these devices can even be produced by the eavesdropper) , 
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then the security of a key can be estabhshed directly by the violation of a Beh 
inequahty. This is an example of "device-independent" quantum information 



field very briefly. 

1.4.1 Device-independent Quantum Information 

A violation of a Bell inequality indicates that if we assume our system is quan- 
tum mechanical, then the shared quantum state was entangled. Therefore, it is 
natural to say that a violation must detect entanglement without making any 
assumption on the system. Indeed this idea has been developed both in the bi- 
partite and multipartite setting where a Bell inequality is used as a "witness" of 
entanglement |Liang20TH IRabelo2011| . In calculating entanglement of a state 
directly, one calculates this quantity directly from the state. However, if we do 
not know the state and we observe a violation, then it must be entanglecj^ 




There are two aspects of bipartite entanglement that make it useful for QKD. 
The fact that random, yet completely correlated outcomes can be generated for 
the shared key, and if measured, the system will no longer be entangled. The 
second fact ensures that the randomly generated key is securely generated. But 
randomness is in of itself a useful resource for many tasks |Knuthl981| , including 
secure key distribution and cryptography in general |Shannonl949| . For example, 
in a Monte Carlo simulation of complicated systems, a random source is required 
to pick a data point at random on which to calculate something [Metropolis 1949| . 
Also, randomly sampling from a probability distribution to perform statistical 
analysis is useful for ruling out statistical bias in this analysis. 

Genuinely random processes are difficult to come by as classical physical sys- 
tems are seemingly random due to lack of knowledge about all parameters of the 
systems. The underlying parameters of the system have deterministic properties 
but our inability to access all of them leads to the assignment of probabilities. 
This form of randomness can be seen as not true randomness due to the underly- 
ing determinism, but "pseudorandomness" |Knuthl981| . However, if we assume 
that locality must be respected then the random outcomes of observables on ei- 
ther side of a bipartite, space-like separated maximally entangled state cannot 
be due to some underlying real parameters. The randomness of the maximally 
entangled state is a good source of randomness. 

^^As well as this device-independent approach to entanglement, Bell inequalities can be used 
to gain information about the dimension of a quantum sytem [Gallego2010| . 



processing |Mayers98 , IAcm200"7l IPironio2010j , and we now review this nascent 
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If we do not assume that we have a maximally entangled state shared be- 
tween two parties, Pironio et al showed that true randomness can be generated 
from the violation of a Bell inequality |Pironio2010j . This randomness from 
a violation can then be used as a "seed" to generate something more random. 
Therefore, randomness can be generated without assuming anything about the 
underlying system that can possibly generate it, and so is device- independent. 
The generation of random numbers |Pironio2010l IColbeck2007] and cryptography 
[Mayers 98 IAcm200"7l IPironio20091 ISilman2011| are two main current implemen- 



tations of device-independent protocols. The motivation behind both of these 
tasks comes from cryptography, but in the next section we give an example of 
computing based on a violation of a Bell inequality in the form of the GHZ 
paradox. 

1.4.2 GHZ Paradox and Measurement-based Quantum Computing 

Models of computing have been related to Bell inequalities. Communication 
complexity is a model where we have several parties and each party has un- 
bounded computational power |Kushilevitzl996j . Each party has some data 
and the goal is to compute some function on all of this data. The question 
is whether all of this data needs to be sent between parties in order for the func- 
tion to be computed? Communication complexity studies the minimum amount 
of communication needed to calculate a particular function. If a system violates 
a particular Bell inequality, then it can exhibit an advantage in a communi- 
cation complexity task over a system that does not violate a Bell inequality 
|Brukner2004l IBuhrman2010| . Another example of a computational model re- 
lated to Bell inequalities is a "non-local game" |Cleve2004] . We will discuss these 
models in sub-section 3.3.2 of chapter |3] and so postpone discussion of the model 



until then. 

In MBQC, the classical control computer can only perform addition modulo 
2, or "XOR gates" as they are called in the Boolean circuit model of classical 
computing |Anders2009j . In order to have a full power classical computer, we 
require another gate: the "NAND gate" |Papadimitrioul994| . The XOR gate on 
two bits xi and X2 is the function f{xi,X2) = xi © X2 but the NAND gate is 
/(xi,X2) = 1 © xiX2 where this function is for xi = X2 = 1 and 1 otherwise. 
Anders and Browne (AB) showed that in MBQC a NAND gate can be performed 
with three measurement sites and a single round of measurements |Anders2009| . 
This three-party system is also the minimal resource in MBQC that can produce 
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this function. AB used the GHZ paradox to demonstrate this result and we now 
review this result [Anders2009j . 

If we inspect the correlations in (1.21), (1.22), ( |1.23| ) and (1.26), the choice of 
measurements can be labeled by bit-values sj E {0, 1} at each jth site. We relabel 
the values si and S2 to be some bit-values xi and X2 respectively. Therefore, for 



the specific correlations in (1.21), (1.22), (1.23) and (1.26), the choice of third 



measurement S3 is equal to xi (B X2- This can be modeled as a computation in 
MBQC where the classical control computer sets the choice of measurement on 
the first two sites to be xi and X2, and xi © X2 for the third site; the classical 
control computer calculates this third choice. Then if the measurements and 
quantum state are those in the GHZ paradox then we observe the correlations 
are 

EiMl'M^'M^^'®''^) = (-If 1^2®!. (1.28) 



The function corresponding to the NAND gate then appears on the right-hand- 
side. We obtain the measurement outcome from the j'th site as (— 1)*"^ where 
rrij £ {0,1} and then the joint outcome of all three parties is ( — l)™-i®™-2ffim3^ 
Then every instance of A^^^ A^2^Al3^®^^ must deterministically produce an out- 
come (— i)™'i©™2eni3 gq^g^i ^_i'^xiX2®i _ jf gg^g]^ gj|-g ggnds the value rrij to the 

classical control computer then it can calculate mi © 1712 © thus obtaining 
X1X2 © 1 deterministically. 

We have shown that classical correlations cannot reproduce the above quantum 
correlations. In order that we produce a NAND gate with classical correlations 
communication in the form of adaptivity is required in the measurement-based 
circuit. This does not minimise the resources required for a full classical computer 
and shows that correlations that are incompatible with local realism are useful 
in MBQC. These ideas will be developed further in section 3.4 of chapter l3j 



1.5 Chapter Summary 

We have introduced the quantum formalism and shown that it has an interesting 
mathematical consequence: entanglement. Not only is entanglement a mathe- 
matical curiosity, it has consequences for our understanding of quantum theory. 
In particular, it challenges the notion of local realism that is satisfied in clas- 
sical physical systems |Bell2004j . The advent of quantum information placed 
entanglement in yet another context, that as an information theoretic resource 
|Horodecki2009] . It then has become an interesting research avenue to link the in- 
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compatibility with local realism with a potential information theoretic advantage. 
This has led to the development of device-independent quantum information. 

Finally we have an indication of the possible applications of Bell inequalities 
to some computational models. When implementing a test of a Bell inequality, 
or Bell test, measurements must be space-like separated and classical commu- 
nication is ruled out. Computation often involves time ordering of operations, 
or gates, and this time-ordering allows the possibility of communication between 
parties. However, when we process statistics from Bell tests, we are perform- 
ing a computation on this data, and in the example of the GHZ paradox we 
can use this processing to obtain something "useful" from quantum correlations 
|Anders2009] . Correlat ions then are computations in this example after this pro- 
cessing. Throughout this thesis, this picture will become central to our under- 
standing of correlations. That is, correlations can be used to compute particular 
functions on an "input" corresponding to the choice of measurement settings at 
all sites. 

This computational insight on Bell tests will be used to give a new perspective 
on established ideas in Bell tests as well as new ideas for Bell tests. We will re- 
view the issue of loopholes in Bell tests |Pearlel970j and indicate that they have 
a computational interpretation that makes this subject more amenable pedagog- 
ically. Motivated by these issues, we describe a way of expanding Bell tests to 
include processing of statistical data but without introducing loopholes. We give 
the notion of a loophole a more technical grounding and present these results in 
chapter |4j 

We have hinted at a connection between Bell tests and MBQC. We extend 
this connection and make it more concrete in chapter [3] by discussing MBQC 
without adaptivity. Using Bell tests we can actually say something about the 
power of MBQC without adaptivity as well as showing that quantum physics can 
do something that classical physics cannot. In chapter [4] we will discuss whether 
MBQC with adaptivity can be framed in terms of a Bell test. We give some 
indication that this is possible using a method of loophole-free data processing. 
Before we discuss applications of the Bell test to computation and vice versa, in 
the next chapter we introduce our framework for Bell tests. 
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2 Correlators and Bell Tests 



In this chapter, we wih lay the foundations for our study of Bell tests |Belll964] . 
More precisely we will motivate the study of what we call correlators: the statis- 
tics of the joint outcome of many parties. We have already discussed (in section 



1.2) correlators in terms of the expectation value of measurements made in the 



Bell-CHSH test |CHSH1969j . We now describe correlators in terms of conditional 
probabilities of a joint outcome given some measurement settings. We will make 
the connection to the Bell-CHSH test concrete and show that considering these 
conditional probabilities allows for greater scope when considering a broad class 
of Bell tests. 

One prominent tool utilised in this chapter is to describe the correlators, or 
conditional probabilities as stochastic maps from a set of inputs (describing the 
measurement settings) to a set of outputs (describing corresponding measure- 
ment outcomes). These maps are then probabilistic maps from an input to an 
output. We will define particular classes of functions and show how they relate 
to correlators resulting from particular physical theories, more specifically locally 
realistic and quantum theories. 

The famous Bell inequality emerges from a discussion on the geometry of 
stochastic maps. Correlators can be represented as vectors in a real vector space; 
every vector is a list of conditional probabilities for each joint outcome for every 
choice of measurement settings. In this real space, the space of LHV correlators 
can be defined as a convex polytope, an object which is the convex hull of a finite 
number of correlators (called extreme points) | Griinbaum2003| . The boundary, 
or surface of a convex polytope is made up of objects called faces. If the dimen- 
sion of the space a polytope lives in is A, then a (A — l)-dimensional face is called 
a facet and facets are defined by particular linear inequalities. These inequalities 
define half-spaces in the A-dimensional real space, and the intersection of these 
half-spaces also define a convex polytope | Grunbaum2003| . For the convex poly- 
tope of locally realistic correlators, the linear inequalities that define its facets are 
the facet Bell inequalities |Pitowskyl989l li^YoissartlMTl IPeresl999l IFinel982] . 
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We combine this geometric picture of correlators with the discussion of stochas- 
tic maps or functions and show that we can describe Beh tests in terms of com- 
putations. This computational aspect allows us to capture locally realistic cor- 
relators in terms of computational expressiveness. Not only is this method used 
to describe correlators, it is used to say something about the full probability of 
distribution for all possible measurement outcomes and settings. In particular, 
correlators single out particular probability distributions that only satisfy spe- 
cial relativity (non-signalling) and no other physical constraints |Popescul994| . 
Finally, we also characterise correlators, and correlations in general that appear 
in a model constructed originally by George Svetlichny jSvetlichnyl987| . This 
model allows a sub-set of parties to share unconstrained correlations but satisfy 
local realism with respect to others. 

This chapter in the main motivates the study of correlators as a simplification 
from studying the full statistics of a Bell test. Despite the simplification, the 
study of correlators yields significant insights into the study of the full proba- 
bility distribution. We also establish the framework upon which results in later 



chapters are built. Section 2.1 consists of review material, and the work in 



section 2.2 introduces a new computational framework for correlators. Section 



2.3 consists of new results describing non-signalling correlations and section 2.4 



recasts Svetlichny correlations in terms of a computational description. The orig- 



inal work in sections 2.2, 2.3 and |2.4| was completed in collaboration with Joel 



Wallman and Dan Browne and published as |Hoban2011c] . 

2.1 A General Framework for Bell tests 

Bell tests are carried out by space-like separated parties that each make a choice 
from a set of measurements and each measurement produces an outcome from 
a set of possible outcomes |Belll964] . From this starting point, it has been in- 
sightful to think of Bell tests, and other physical processes from an operational 
point-of-view |Hardy2001 Hardy2011, IBarrett2007j . In an operational frame- 



work, each measurement site is an abstract object, often referred to as a "box", 
that takes an "input" as the choice of measurement setting and returns an "out- 
put" in the form of a measurement outcome. Operationally then, we only concern 
ourselves with the statistics resulting from these boxes and not necessarily their 
"inner- workings". We only want to infer the properties of these boxes from their 
statistics making minimal assumptions. 
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sj e {0, 1, (c - 1)} m, e {0, 1, (d - 1)} 



Figure 2.1: In a Bell test, n parties each make a measurement from c possible 
choices, where each measurement has d possible outcomes. Labelling 
the jth party's measurement choice and outcome by sj and rrij re- 
spectively, we can describe each run of the experiment with n-digit 
strings m and s. (Copyright: American Physical Society, 2011). 



We then consider n space-like separated parties, or boxes. Each jth site for 
j G {1, 2, n} makes a measurement from a choice of Cj measurements 
where Sj G {0, 1, (cj — 1)} labels the choice of measurement and is expressed in 
terms of an integer, or digit in , the cyclic group of Cj elements. Each measure- 
ment Ais- has d{sj) possible outcomes Om(sj)5 where m{sj) € {0, 1, (d(sj) — 1)} 
is an element of the cyclic group of d{sj) elements. Therefore, in opera- 

tional terms, each jth box takes an input Sj and returns an output m{sj) for each 
input. From now on, we assume that d{sj) = dj is constant for all measurements 



labelled by Sj. We include a schematic of the Bell test in Figure 2.1 



Inputs into all n boxes are represented by n-length digit-strings s G ©j=i , 
the Cartesian product of all sites' inputs s = {si, S2, Sn}^ All digit-strings will 
be expressed in bold typeface, with jth elements not in bold, but with sub-script 
j. Outputs are then expressed as re-length digit-strings m = {mi,m2, ■■■,mn} 
where we have not explicitly written the dependence on s for brevity, but this 



^We are using this perhaps unconventional notation for the Cartesian product for the sake of 
brevity. Typically the Cartesian product between sets A and B is represented as ^4 x B 
and Ai x ... x A„ for an n-fold Cartesian product of sets Aj. In this non-standard notation 
Ai X ... X An = Aj . 
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dependence is implied. That is, every output is always a particular output m 
for a given input s. Given non-empty sub-sets J C {1, 2, n} of all n with \J\ 
being the number of parties in the sub-set, the outputs of this sub-set is written 
as m'f-'l-'^^J' = {mj\j G J7} such that singleton sub-sets are the elements rrij. The 
same notation is applied also to the inputs s with s^-'l-'^'^J' = {sj\j € J} being 
the inputs on a sub-set J. When J includes all n parties then we recover m as 
the output again. There are then (2" — 1) of these non-empty sub-sets J. 

Central to the standard construction of Bell tests is that choice of measurement 
setting is independent of anything else in the experiment |Belll977] . In other 
words, the choice of measurement is completely random, i.e. p(s) = rf"_i — . 
The consequences of relaxing the constraint of measurement independence have 
been shown to be detrimental to Bell tests |Barrett201H IHall201l] . 

As mentioned, in Bell tests, statistics are calculated from the data obtained 
from the boxes. The statistics are the conditional probabilities p{m^^\^^'^^\s), the 
probability of obtaining outputs m^-^'-^^'^^ given the input s for all sub-sets J of 
n parties. Crucially though we can obtain every probability p{m^^\^'^'^^\s) for a 
proper sub-set J7 from the full distribution p(m|s) by taking a sum of outcomes 
on the complement sub-set J''^ = {j\{l,2, ....,n} \ J} to J of all n parties, i.e. 
p(^m{ili6^}|s) = Ylj£j<: p(m|s). Therefore from now on we only need to consider 
the full probability distribution p(m|s). In the following sub-section, we discuss 
the basic geometric objects that will dominate our discussion of correlations: the 
convex polytope. 

2.1.1 Convex Polytopes and Stochastic Maps 

The conditional probabilities p(m|s) are stochastic maps producing the map, 
or function / : ©"=i — ^ ® j=i '^<ij '^^^^ some probability. Throughout this 
thesis, we will use a geometric picture to consider these (and other forms of) 
stochastic maps. These conditional probabilities p(m|s) are elements of a vec- 
tor p in a real vector space. We can reduce the number of probabilities we 
need to consider by the normalisation condition that X^mP(m|s) = 1 where 
p{0\s) = 1 - Em7^oP(m|s)- Vectors p have length D = (11^=1 - '^)IYj=iCj 
in real space Since the elements of p are probabilities, they will live 
in a bounded sub-space in M'^ satisfying the constraints that all p(m|s) > 
and X]m^o^'('^l^) — ^' positivity and normalisation constraints respectively. 

^This can be seen from the fact that we have rij=i normalisation conditions (one for each 
input string) and 117=1 '^j 117=1 Cj original probabilities p(m|s). 



40 



These inequalities essentially describe a convex polytope in M which we call J-. 
Convex polytopes will be a central part of this thesis for all manner of different 
real spaces and so we shall define them for all possible real spaces now. 

A convex polytope C in a real space of dimension A can be defined in 
two ways: first is the half-space representation and the second is the vertex 
representation |Grunbaum2003| . We will now formally define C in terms of each 
representation: 

Definition 1. (Half-space representation): A convex polytope C in a real space 
M'^ of dimension A is the intersection of closed half-spaces. These closed half- 
spaces are defined by linear inequalities of the form ajvj < b for real values 
Oj and elements Vj of a vector v G M^. 

This definition is general enough to encompass unbounded polytopes. We say 
a convex polytope is bounded if it can be contained in a ball of finite radius and 
unbounded otherwise. We impose the extra constraints that there are a finite 
number of inequalities that form a bounded polytope |Grunbaum2003 | . 

The linear inequalities in the above definition are "facet-defining" which we 
define formally later on but can be informally seen as the boundary of the con- 
vex polytope C. If we return to the example of J- as the space of all possible 
conditional probabilities p(ni|s) then the linear inequalities defining are the 
positivity and normalisation constraints. As mentioned, dual to the half-space 
representation, the vertex representation of a convex polytope describes the poly- 
tope in terms of all points, or vectors in the polytope: 

Definition 2. (Vertex representation): convex polytope C in a real space 
of dimension A is the convex hull of E extreme points, or vectors Ve E for 
ee{l,2,...,E}. 

The convex polytope C then is the set of vectors that can be written as a 
convex combination of E vectors in the A-dimensional real space. For example, 
the polytope can then be written in terms of the convex combination of E 
vectors p which we call pe for e G {1, 2, ...,E} with probability distribution p{E) 
over each pe. These vectors have the elements pe that are the deterministic 
probabilities p(m|s) G {0, 1}. For these deterministic probabilities we associate 
values {0, 1} with each map / : s — t- m. This way the probabilities can be written 

-* f(s) 

as p(m|s) = YlEPi-^)'PE = J2fPf^m where /(s) is the image of s under / and 
Pf is a probability distribution over all maps /. 
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2.1.2 The Non-signalling Polytope 

One can make extra assumptions upon the statistical data obtained from space- 
like separated sites: each measurement site cannot communicate with each other 
outside each other's light-cone. This assumption is expressed in terms of the 
no-signalling condition which can be formally stated as: 

p{m\s)= Mm|s') =i3(m^^'l^'^^'^|s^^'l^'^^'>), (2.1) 

where J' is any sub-set of all n parties and J''^ = {j G {1, 2, n}|j ^ J'} is the 
complement of this sub-set and s 7^ s' such that the inputs differ in elements sj 
with j & J |Popescul994| . Each of these conditions forms a hyperplane in MP 
and then the intersection of hyperplanes is the space of correlations that satisfies 
the no-signalling condition. Or just as before, one can reduce the dimensionality 
of the space of statistics by imposing these equalities and then define inequalities 
on the reduced space. 

Therefore one can construct another convex polytope called MS which is the 
intersection of half-spaces defined by inequalities resulting from the normalisa- 
tion, positivity and no-signalling conditions [Barrett2005bj . We shall discuss the 



polytope MS in section 2.3 of this chapter. Now we consider the correlations 



that satisfy local realism, or local hidden variable theories. 

2.1.3 Local Hidden Variable Theories and Bell Inequalities 

A Bell test is an experiment that aims to test whether the statistics produced by 
boxes can be satisfied by a theory that obeys local realism. Systems that satisfy 
local realism satisfy two conditions (covered thoroughly in |Bell2004] ) : 

1. Realism: There are objective properties of a system that are elements, or 
"hidden" variables A G A in a (generally continuously defined) space of 
hidden variables A. These variables have a pre-existing value before the 
measurement is made and can influence measurement outcomes; 

2. Locality: The variables A possessed by a party at any site are not affected 
by events that occur outside of the light-cone of the measurement made at 
this site. These variables are called Local Hidden Variables (LHV). 

Each party's measurement is influenced by A and the measurement choice made 
at that party's site. In an LHV theory, space-like separated parties cannot com- 
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municate their measurement information to each other via the LHV, or any other 
means, due to locahty. Measurement outcomes are then influenced by sj and A 
alone. 

To be more precise, there is a probability distribution p{X)dX over A such that 
p{X) > and J^p{X)dX = 1. This can occur, for example, if the parties have 
some shared source of randomness over the variables A. Therefore each set of 
measurement outcomes conditioned upon measurement settings can be written 
in the following form |Belll964] . 

p(m|s) = / p{X)dXY\pimj\sj,X). (2.2 



/ pWdXYl 



This expression can be written in terms of a convex combination of deterministic 
maps gj : Zc^. — )• Z^^ at each site. The single site probabilities are then written as 
a convex combination over all deterministic maps, i.e. p{mj\sj, X) = Ylg Pgj^T/ 
where gj are the single site maps with pg. > and J^g^ Pgj = 1 • If one considers 
all n deterministic single-site maps, then we can deterministically obtain the re- 
sulting output digit-string from all parties m = {gi{si), g2is2), gsiss), 
where gj{sj) is the image of sj under the single-site map gj. As a result, equation 



(2.2) can be rewritten as: 



p(m|s)= ^'9i.92,...,9nnCL)' (2-3) 



91,92, ■■■,gn j 



taking a convex combination over all combination of single site maps gj so that 
P9i,92,-,9n ^ X^^^ P9i,92,-,9n = 1 is satisfied. Note that the decompo- 

sition in (2.3) is not unique; uniqueness is only guaranteed when Pg]_,g2,...,g„ = 1 



for a particular choice of single site maps. 



We see immediately from (2.3) that the space £jr C of LHV correlations 



p(m|s) is also a convex polytope as defined in terms of a vertex representation. 
The vertices of Cjr are the deterministic probabilities p(m|s) = Ff?'^'"^^ ^ cor- 

'■J 9j (S) 

responding to each combination of single site maps gj. There is also the facet 
representation of the polytope Cjr in terms of facet-defining linear inequalities. 
These linear inequalities are the facet-defining Bell inequalities, which we abbre- 
viate to facet Bell inequalities, that constrain and define the consequences 
of LHV theories |Conins2004l IFroissartlOSTl |Pitowskyl989l IPeresl999j . We now 



formally define what is means for a linear inequality to be facet-defining. 
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Definition 3. A linear inequality is facet-defining for a convex polytope C in a 
real space of dimension A when at least A affinely independent extreme points 
of C saturate the inequality (i.e. satisfy the equality of the linear inequality). 

A set S oi K vectors pi, S = {poiPi) ■■■■,P{K-i)} is affinely independent if for 
every p^ G 5, the {K — 1) vectors in the set {pi — Pk\Pi 7^ Pk} are linearly 
independent. A linear inequality for the space of correlations is of the form: 

J^/3„,,sp(m|s) <7£, (2.4) 

m,s 

where /3m,s are real pre-factors depending on m and s and 7£ G M as the upper 



bound resulting from LHV correlations in (2.3). All LHV correlations satisfy 



(2.4) whether the inequality is facet-defining or otherwise. For the inequalities 



to be facet Bell Inequalities the following conditions must be satisfied: 



for at least (11^=1 11^=1 affinely independent vectors p such that elements 

are p(m|s) = Hj '^™(s)- ^an demonstrate this schematically in Figure 2.2 
where we show that a facet Bell inequality picks out the surface of the L 



HV 



polytope, whereas the inequalities in (2.4) might just bound the LHV polytope. 



We will make these ideas concrete in chapter [3} 

The problem of finding the facets of a polytope given the vertices is known 
as the facet enumeration problem |Collins2004j and software does exist that per- 
forms this task (e.g. |Polymake2000| ) . However, it is currently in general both 
theoretically and practically hard to find these inequalities as we shall discuss 
in the subsequent sub-section. The hardness of this problem will motivate us to 
think about simplified Bell inequality settings, and then relate these simplified 
settings to a more general setting. 

2.1.4 Facet Bell Inequalities and Computational Complexity 

Given our abstract setting for n parties each with Cj possible inputs and dj pos- 
sible outputs, it is immediately natural to ask how hard is it to obtain Facet 
Bell Inequalities? Pitowsky notably studied this question by studying the in- 
timate link between convex polytopes and propositional logic Pitowskyl989| . 



The latter then has a deep connection to computational complexity, the branch 
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Figure 2.2: The LHV polytope can be defined in terms of the facet Bell inequali- 
ties. These inequalities intersect the surface of this polytope; whereas 
an arbitrary Bell inequality might only intersect one vertex as shown 
in this schematic. 



45 



of theoretical computer science devoted to the hardness of computational prob- 
lems |Papadimitrioul994| . 

Whether a problem is easy or not can be defined in terms of time efficiency of 
finding a solution on a Turing Machine (an abstract computer that can simulate 
all other computers |Turingl937| ) with respect to the size of the input. Time is 
defined in terms of computational time, or the number of computational steps 
in an algorithm. The computational time as a function of input size is then the 
indicator of computational efficiency, or hardness. If this function is a polynomial 
in the size of the input, then it is efficient, if super-polynomial (e.g. exponential) 
then it is inefficient |Papadimitrioul994| . 

The problems that are given to a Turing Machine are decision problems. That 
is, given an input the machine outputs 'yes', represented as the bit or 'no', 
represented as bit 1; the decision problems are questions with binary potential 
outcomes. If the algorithm performed by the machine to make this decision 
operates in a number of steps that is polynomial in the size of the input, then it 
is in the complexity class called P. It is key to note that the algorithm must be 
polynomial in the input size for all possible inputs, as there may be inputs that 
are easier to compute than others. If problems in P are efficiently solvable, then 
there is another class of problems where solutions can be checked (for veracity) 
in an efficient amount of time. This complexity class is called NP. 

The question of whether P is equal to NP is one of the greatest mathematical 
unsolved puzzles. Discussion of this problem is outside of the discussion of this 
thesis. However, if P = NP then in loose terms it would be as easy to solve a prob- 
lem as to check the validity of the solution; this may seem intuitively incorrect 
to both a casual reader and a computer scientist. The hardest decision problems 
in NP are called NP-complete and so if P = NP then these problems have a 
polynomial time solution. At the current time, no polynomial time solution is 
known for NP-complete problems. 

Pitowsky has shown that finding the facet Bell inequalities is at least as hard 
as any of the NP-complete problems, if not necessarily in NP |Pitowskyl989[ 
Pitowskyl991| . In the terminology of computatational complexity, this problem 



is NP-hard. Heuristically, Pitowsky showed this by relating the problem of find- 
ing a facet Bell inequality to a Boolean satisfiability problem Pitowskyl991| . 
These problems ask whether there exist variables that result in a Boolean func- 
tion being 'true' and are NP-complete |Papadimitrioul994| . The vertices of the 
LHV polytope £jr consist of vectors with elements being or 1, which are truth 
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value assignments. This relationship between vertices of the polytope and truth 
assignments allowed Pitowsky to say that finding facet Bell inequalities is at least 
as hard clS Si Boolean satisfiability problem. 

We have given an overview of the geometric construction of Bell inequalities 
and the space of correlations. The implications of local realism are connected to 
the study of convex polytopes. Polytopes have also been used to study the space 
of non-signalling correlations; we shall return to this subject in section [2?3l of this 
chapter. Finally we have shown that finding the Bell inequalities that define the 
LHV polytope is a computationally hard problem. This motivates our study of 
correlators, the probability of a joint outcome between all n parties instead of 
the full probability distribution. This simplifies a hard problem by decreasing 
its dimensionality. Correlators also have a computational perspective that will 
become crucial to our study of Bell inequalities. 



2.2 Correlators 

Finding facet Bell inequalities is difficult. This difficulty scales with the size of 
the problem such as the number of possible inputs and outputs for n parties. 
Firstly, we assume that dj = dk = d (for j ^ k) is the same for all parties and is 
prime. We also simplify the type of probabilities that we need to consider from 
the correlations p(m|s) to a correlator which takes the form: 

p{k\s) = J^Jp-^'^^^Mmls) =p([X;m,], = k\s\ (2.6) 

m j=l 

where throughout this thesis (unless otherwise stated) we write all modulo x 
arithmetic in brackets with a sub-script [..]x- From this perspective, the Bell test 
now consists of inputs s G Zc, and a single value output k = Y]^--, rrij 

J -L ■> — Ad 

is returned. 

The correlator p{k\s) is a stochastic map / : 0"=i and due to the 

normalisation X]fe=o?'(^l^) ~ ^ ^' o^^y need to consider [d— 1) 11^=1 

correlators. We do not consider, then, the correlator p(0|s) for all s as it can be 
recovered by normalisation. These correlators are now elements of a real vector 
k G M^"^"^^ nj=i which we call a correlator vector^ 



^If there is no conflict in meaning, we may shorten correlator vector to just correlator. For 
example, if we refer to correlators being in some space, this means the resulting correlator 
vectors are in some space. 
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We can describe the space of all possible correlator vectors as another convex 
polytope V. This object, analogous to has a simple description in terms of 
vertices and linear inequalities. First, V has E vertices, or extreme points, ke for 
e G {1, 2, E} that have the elements pe{k\s) e {0, 1} for all k and s. Therefore, 
these vectors correspond to deterministic maps where a single value of k is the 
output given the input s so elements are pe{k\s) = 5^^^-^ where /(s) is the image 
of s under the map / : ^d- Any correlator vector k G M^"^"^) 11^=1 '^j 

can be written as a convex combination of these extreme points: 

k = ^peke = ^Pfkf (2.7) 
e / 

where every extreme point ke corresponds one-to-one with a vector kf resulting 
from a function / : 0"=^ — ^ and Pe > 0, > with Xle^e = 1 and 
'}2fPf ~ 1 functions /. 

Equivalently we can describe V in terms of the linear inequalities corresponding 
to positivity and normalisation: p{k\s) > for all k and s and X]fc^o^'(^l^) — ^• 
This is analogous to the way we defined J- but interestingly, every vector in 
V can be produced by at least one probability distribution in AfS, the non- 
signalling polytope. If we allow all probability distributions that satisfy only 
the no-signalling condition we can completely saturate V. As an example, every 
vertex of V corresponding to the map / : ©"=i — we can always write 
this probability distribution: 

f I ^ ifE-=i"^.k = /(s), 

p(m|s) = (2.8) 

otherwise. 

for the function / as above. All reductions of this probability distribution are 
p(^rn^j\j&J}\s{j\j(^J}) = d-\^\ if \J\ / n for ah m^iH^^} and s^^\^^^\ Since this 
distribution is uniformly random for all sub-sets of parties, it satisfies the no- 
signalling condition. We shall elaborate on the connections between MS and V 




in a subsequent section 2.3 



Another motivation for these correlators is that they are a generalisation of 
the well-studied CHSH Bell Inequality setting for many parties |CHSH1969[ 



IWerner2001t IZukowski2002| . This generalisation also coincides with the Collins- 



Gisin-Linden-Massar-Popescu (CGLMP) setting again generalised to many par- 
ties |CGLMP2002j . An example of work in a many-setting CGLMP framework 
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includes that by Acm et al |Acm2004] . 

Also, in the literature, correlators can be considered to result from the expecta- 
tion value of the outcome of joint measurements if the outcomes of measurements 
are complex numbers of unit modulus |Lee2007l ISon2006] . More specifically, ev- 
ery jth party's measurement -Adsj has the outcome values e^^'^s for k G Z^, then 
the expectation value of the joint measurement E(s) = IE(nj=i -^Sj) i^: 

n 

E(s) = ^ JJe^2^^p(m|s) 

m j=l 

(d-1) 

A:=0 

= 1+ ^ [e^2-S -l]p(A:|s). (2.9) 

k=l 

These expectation values IE(s) can be written in terms of correlators. Every 
measurement that has two possible outcomes — 1} results in expectation 
values of measurements being E(s) = 1 — 2p(l|s); the expectation values are 
equivalent to a single correlator ]?(l|s). This is the many-party generalisation of 
the CHSH setting for two parties. This equivalence has allowed research in the 
past to interchangeably use expectation values as well as conditional probabilities. 

One can coarse-grain research into generalized Bell inequality setting as ei- 
ther obtaining statistics in terms of correlators (e.g. |CHSH1969| ICGLMP2002| 
IAcm2004j ') or the fuU probability distribution (e.g. |CH1969l [Collins20040 . The 
latter can be reduced to the former but much literature has been devoted to the 
study of correlators. As well as being able to infer structure of MS from V (see 
section 2.3), these correlators are at the centre of much research into Bell in- 
equalities. We will now try and formalise the structure of the space of correlator 
vectors by considering the maps performed by all possible theories. 



2.2.1 Correlators as Computations 

Throughout this thesis we argue for a computational approach to Bell inequality 
experiments by considering in what sense correlations are computing a function 
/ • ©5*=! — ^ on the inputs s. In this section we now want to introduce 
some of the tools associated with these functions so that we can be more specific 
about the computational power of correlations from physical (or non-physical) 
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theories. 

Every function / : ®"=i — ^ can be written as a list (a single column 
table) with each row representing /(s), the image of s under /. In turn this 
list is an element of the module M over the ring Z^. The module M consists 
of the abelian group Zf for D = n"=i Cj with the group multiplication being 
modulo d addition of these elements, the module also has (left or right) scalar 
multiplication Z x Z^ — )• Z^ of elements in the group. In order to satisfy M 
being a module then for all x, y in Z^, and all a, b in Z^^ then: 

1. Ix = X (existence of the identity) 

2. a(6x) = (a6)x (associativity) 

3. a(x + y) = ax + oy (distributivity over Z^) 

4. (a + b)x = ax + 6x (distributivity over Z^^), 

where we could have written the scalar multiplication in terms of left or right 
multiplication |Andersonl992j . All arithmetic is modulo d but we have sup- 
pressed the notation for clarity. 

Every / G M can be written in terms of Kronecker delta functions with ele- 
ments /(s) = 6y which is 1 for only one input s = y G ©j=i and otherwise. 
Therefore every element /(s) of any function / can be written as 

/(s)= E f{y)6^y (2.10) 

with /(y) S Z(i. The delta functions then form something analogous to the basis 
vectors for a vector space and we can replace one of the delta functions with the 
constant, all-ones function with elements /(s) = 1. The delta function we choose 
to replace is /(s) = dy with y = 0, the all-zeroes digit-string. 

For every sj £ Zc^ we can choose to represent Zc^ as a Cartesian product of 
cyclic groups of dimension being the prime factors of Cj . The set of prime factors 
of Cj are written as {^Cj,'^ cj, Cj} for '^cj as the kth prime factor and qj 
being the number of prime factors, therefore Sj = {^Sj,^ Sj, Sj} G ©^=1 ^^Cj 
with ''sj E Zfc(,.. The delta functions Sy can be written now in terms of inputs 
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s e 0^=1 (eti^^c,) and y G {®lU^^c,) giving 



n 9j 

4 = nn<^; 



i=i fc=i 

n <?j 

= nn[ 

j=ik=i 

n Q] 

= nn 

i=i fc=i 



1 



1 



z=o 



Ilk, 



(2.11) 



where the second hne is guaranteed by Fcrmat's Httle theorem. That is, the 



modular arithmetic expression is 



= 1 for ''sj ^ and 



coprime with ^Cj. The third hne above just results from the binomial theorem. 

In the instance where Cj = Ck = d being prime for all j ^ k, the delta functions 
just simplify to being a polynomial over the field as indicated by the third 
line above. For example, for d = 2 the delta functions are Boolean functions 
/ : Z2 — ?• Z2 expressed as polynomials over Z2. We will use these properties 
more explicitly in subsequent chapters of this thesis. 

We now introduce classes of functions that will be used to characterise the 
correlators resulting from particular theories. The first class of functions we now 
describe as "n-partite linear functions" have connections to LHV theories. 



Definition 4. An n-partite linear function is a function g : ® j=i "^c 
where the image of s under g can be written as 



5(s) 



(2.12) 



with gj{sj) the image of Sj under the single-site map gj : "Lcj "^d- 



These functions are not strictly linear as the single-site maps gj are not always 
linear in Sj, but for cj = d = 2, then these maps are linear. We use the nomencla- 
ture of linearity only to highlight the fact that there is addition modulo d between 
single-site maps and not multiplication. If a map cannot be expressed as an n- 
partite linear function then we say it is a non-n-partite linear function. 

Any function / : 0^=1 — > ^d can be described in terms of a sum of an n- 
partite linear function and non-n-partite linear function, i.e. /(s) = [5(s)-|-/i(s)]d 
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where g(s) and h{s) are n-partite and non-n-partite linear functions respectively. 
First we write an n-partite linear function in terms of the delta functions dy^- for 
single-site maps to obtain: 



5(s) 



j=i k=i 



(2.13) 



with a, f3j^k G Z^. The constant a emerges from taking the sum modulo d of the 
constant function aj G for each site, that replaces the delta function 5q , as 
discussed. 

For inputs s with only one non-zero clement, there is only a single delta 
function (for Sj being the non-zero clement) that describes the value of 
^(s) = [a + /3j,fc^^^]^- For the all-zeroes digit-string s = 0, then the only function 
describing ^(s) is the constant function a. We call the set of digit-strings s with 
at most one non-zero element T. 

Now we briefly consider a column list of the images of s under / for only these 
digit-strings s G T. Then for this restricted list delta functions S^^ (and constant 
a), similar to before, form a basis for any function /(s) with s G T. This is 
because they are equivalent to the delta functions Sy = S^^ TVl^j ^'^^ these 
particular input digit-strings. A basis is formed in the sense that these functions 
are linearly independent over Z^. 

In order to achieve any function /(s) we need a basis for the functions for 
all possible input strings s including s G T. We do this by supplementing the 
function g{s) above with the delta functions dy with y ^ T. Therefore, any 
function can be written as a sum of an n-partite linear function g{s) and a non- 
n-partite linear function 



a 



n Cj-1 
j=l k=l y^T 

= [g{s)+h{s)]a, 



(2.14) 



with 7y G Zrf and /i(s) = ^y^7-7y<^y as a non-n-partitc linear function by con- 
struction. We describe this form of /(s) as the decomposition of the function 
into n-partite linear and non-n-partitc linear functions. If h(s) =0 for all s, then 
/(s) is necessarily an n-partite linear function, otherwise it is necessarily a non- 
n-partite linear function. If, on the other hand, g{s) =0 we have the following 



52 



result: 

Lemma 1. If a function f : — ^ has no n-partite linear function in 
its decomposition, then f{s) = for all s £ T. 

Proof - If a function /(s) = [^(s) + h{s)]ii has no n-partite linear function part, 
i.e. g{s) = for all s but some non-zero non-n-partite linear part, i.e. h{s) ^ 
for some inputs s, then it can be written as. 



(2.15) 



Then /(s) must be zero for all s € T. □ 



We will use this lemma in the proof of Lemma[3]in section 2.3.1 and is a useful 
consequence of choosing this decomposition of functions. We shall also show in 
the following subsection that this decomposition is physically motivated and not 
just mathematically convenient. 

Another class of functions will now be introduced and shown to be useful in 



later sections 2.3 and 2.4 of this chapter. They can be seen to be a relaxation 
of the constraint of n-partite linear functions where instead of taking a sum of 
single-site maps, we take a sum of maps produced by sub-sets of all parties. In 
particular, we consider all the ways in which n parties can be partitioned into a 
non-empty sub-set J' and its complement as introduced in section |2.1[ The 
class of functions called "bipartite linear functions" are then a generalization of 
n-partite linear functions defined for these partitions. 

Definition 5. A bipartite linear function is a function / : — ^ 
where the image of s under / can be written as 



/(s)= r(s 



(2.16) 



with /I : e^.g^: 



and /2 : ©^g^c 



Zrf being functions mapping 



inputs for each partition into J and J'^ to a single output. 

These functions are equivalent to an n-partite linear function for n = 2 as the 
partition can be seen as a coarse-graining of n parties into two sub-sets, where 
each sub-set can be considered a party in its own right. Then the input string 
glib'ej'} j^Q^ single input to one 'collective' party and s^^^^'^'-^"^ the input to 
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the other collective party. Each subset's collective output is just then the sum 
modulo d of all of their outputs rrij for j £ J and j G J''^ for each respective 
subset. As a result, for a given partition, any function /(s) can be written as a 
sum of a bipartite linear function and a non-bipartite linear function. 

We have described classes of functions, and every function describes a vertex 
of V. To recapitulate, a vertex of V has the elements p(A:|s) = (^^j-^-j for every 
s. A correlator captures a computation whereby given some input, an output is 
produced with some probability. The region of V that is subsumed by a particular 
theory can then have a computational interpretation in terms of how 'close' the 
region of a particular theory gets to vertices of 7^. In the following subsection we 
will discuss the region of correlators achievable in an LHV or quantum theory. 



2.2.2 Correlators from Physical Theories 

Bell tests aim to expose statistics that do not result from a particular class of 
theories viz. LHV theories. We will now describe the space of correlators C QV 
resulting from LHV theories. This space can be defined in terms of the language 



of stochastic maps, and in particular, the functions defined in the subsection |2.2.1 
We now present the following theorem which defines C in terms of a sub-class of 
all possible functions / : ©"=1 — ^ ^d- 

Theorem 2. The space of LHV correlators C is the convex hull of deterministic 
correlators p{k\s) = 5^'^^^ for f{s) being all of the n-partite linear functions, for 
all s. 

Proof: The proof follows simply from how the probabilities p(m|s) are defined 



in (2.3) to obtain correlators: 



n 

= EP^isAs)^ (2-17) 

where ^(s) = [J2]^=i 9ji^j)]d is an n-partite linear function by definition and 
Pg(s) ^ and Ylg{s)Pg(s) — 1- Therefore all LHV correlators are contained in the 
convex hull of n-partite linear functions. □ 



The consequence of this theorem then is that correlators resulting from LHV 
theories have a limited computational expressiveness. That is, no correlator re- 
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suiting from an LHV theory can deterministically perform a non-n-partite linear 
function. This is one of the main computational perspectives that we employ in 
this thesis, and we will return to this result throughout. 

The CHSH inequality is a facet-defining Bell inequality for the LHV polytope 
L for n = 2 and c\ = = d = 1 |Finel982j . We have shown previously that 
this inequality can be violated by quantum correlators, therefore they cannot 
in general be confined to the polytope £ for all possible values of n, Cj and 
d. Tsirelson showed that there is an equivalent CHSH inequality for quantum 
correlators denoted: 

p(l|00) +p(l|01) +p(l|10) -p(l|ll) < l + \/2f«2.41, (2.18) 

whereas the upper bound for LHV correlators is 2 |Tsirelsonl980j . However, the 
vertex of V described as p(l|s) = (5^j.g^ with /(s) = \s\S2 + 1]2 gives a value of 3 
for the CHSH inequality. Therefore, there is a hierarchy of spaces of correlators 
such that £ ^ Q C "P with Q as the space of quantum correlators. 

Defining the space Q of quantum correlators (and correlations in general) is 
still a major open question but we can indicate some general properties of Q. As 
Pitowsky has previously shown, Q is convex, but not a polytope |Pitowskyl989| . 
Quantum correlators can be written in terms of the probabilities p(m|s) which 
result from measurements on a quantum state p, i.e. 

m 

n 

= E^F""''''Tr(p(g)f'^,) (2.19) 

m j=l 

where Pmj is a single-site POVM corresponding to an outcome rrij given the 
choice of measurement sj so that Ylmj -^"ij ~ ^' identity matrix. If each 
measurement site has access to a Hilbert space Tij, then the state p is in general, a 
density matrix acting over the tensor-product of these n Hilbert spaces (S)j'=i T^j- 
The dimension of each Hilbert space is arbitrary (and possibly infinite). 

Naimark's theorem indicates that any POVM is equivalent to a PVM on an an- 
cilla Hilbert space (it also applies for infinite dimensional systems) |Paulsen2003| . 
Therefore every correlator can be written in terms of a state p' and projectors QT/ 
on the Hilbert space (2)j=i where Ti'j is each jth site's enlarged Hilbert space. 
Projectors QT/ are expressed in terms of each site's orthogonal basis \mj)sj for 
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the Sj choice of basis, i.e. QT/ = \mj)sj{mj\sj such that QT/ Q7/ = QT/ ■ 



A density matrix can be constructed from a convex combination of pure states 
p' = Y^iPi\ipi){'ipi\ so that, 



p{k\s) 



[E"=i m,U 



\m 



(2.20) 



Since p{k\s) = J2i 



\m 



is itself a quantum correlator, 



Q is convex and the extreme points of Q will be defined by particular measure- 



ments on particular pure states, i.e. p' = |'i/'/)(V'/| 



Q is not a polytope with a 

2 



IS con- 



finite number of extreme points as the inner product {^i \ (S)j=i \''^j)sj 
tinuously defined over the reals for all pure states and bases \mj)s^ ■ Heuristically, 
if an extreme point of Q is outside of £ then there is a correlator that is arbi- 
trarily close to this point resulting from a pure state that may also be extreme 
(see a far more rigorous analysis in [Pitowsky 1989| ) . 

As mentioned, actually finding the extreme points for all settings is a major 
open problem in current research. However, there do exist instances where the 
extreme points can be defined, particularly with n parties where each site has 



two inputs and two outputs (see section 3.2.3). Numerical methods exist for 
finding the boundary of Q using semi-definite programming jNavascues2008j and 
optimization over measurement bases given a particular state (e.g. the maximally 
entangled state for n = 2) |Durt200H IAcm2002] . We will elaborate on this point 
further on in section |3.2| of chapter [3] 

There has recently been a different tack to defining Q; is there some physical 
principle that captures the boundary of Q? Instead of being a difficult cal- 
culation, is there is an underlying reason why the extreme points are the way 
they are? There is no definite answer to this, only indications of an answer 
(e.g. work presented in |Navascues2009j and |Oppenheim2010| ) . Interestingly, 
this approach has been extended to finding information theoretic principles that 
define extreme points. For example, if an extreme point were further from C 
then parties would be able to accumulate more information than is communi- 
cated to them |Pawlowski2009j or perform calculations with a "trivial" amount 
of communication |Brassard2006j . 

Popescu and Rohrlich began this exploration of finding what defines the quan- 
tum region |Popescul994| . They originally asked whether it was special relativity 
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that limits the region of Q but the answer to this is negative. The vertex of V 



corresponding to J?(l|s) 
fohowing distribution: 



with /(s) = \s\S2 + 1]2 can be produced by the 



p(mi,m2|si,S2) 



\ if [mi + m2]2 = {s\S2 + 1]2, 



otherwise, 



(2.21) 



which is an example of a non-signalling probability distribution, as described in 
section |2.2[ more specifically it is a form of "Popescu-Rohrlich Non-local Box" 
(PR box) |Popescul994l IBarrett2nn5b) . However, it violates the Tsirelson-CHSH 
inequality above and so cannot result from quantum theory. On the other hand, 
it shows that there is a connection between the structure of V and the non- 
signalling polytope. In fact, this PR box is the only non-signalling distribution 
that can produce the corresponding vertex oiV . In the following section we make 
this unique connection more concrete. 



2.3 Non-signalling Correlations 

In this section, we will elaborate on the connections between the polytopes V 



and MS. We have mentioned that the PR box in (2.21 ) is the only non-signalling 
correlation that can be associated with achieving a particular vertex of "P. If we 
assume our resources are non-signalling and we achieve a vertex of V associated 
with the function \s\S2 + 1]2 with only one possible probability distribution. 
This is no coincidence, but one example of an infinite number of non-signalling 



probability distributions of the form (2.8) that can be uniquely associated with 
a vertex of V . 

We suggest that a vertex of MS corresponding uniquely to a vertex of V is one 
possible way to generalise a PR box to more scenarios. We introduce another 



possible generalisation of a PR box in the next chapter in section 3.4.2 First 
we discuss the n = 2 situation and show that a vertex of V that is not in L can 
be uniquely associated with a vertex of MS. We use the results we obtained 
for these bipartite PR boxes to consider the n > 2 case. In this multipartite 
scenario, again we can uniquely associate vertices of V with MS. In all of the 
discussion in this section, we assume that d is prime. 
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2.3.1 Generalised bipartite PR boxes 

The following lemma shows that there is a uniqueness relation between vertices 
of V and a distribution in NS for n = 2, or bipartite Bell tests. This result gives 
us many new ways of immediately generalising the PR box. 

Lemma 3. For every function f : TLc^ ^ ~^ that is non-n-partite linear 
for n = 2, the only non-signalling distribution compatible with the corresponding 
vertex p{k\s) = in V is 



pimi,m2\si,S2) 



d 1 i/[mi +7712]^ = /(s), ^^^^^ 
otherwise. 



Proof: The condition p{k\s) = (5^^^^ for all s = {si,S2} implies that for every 
value of rrii in p(mi, m2|si, 52), there exists a unique value of 7712 = [/(s) — mi]^. 
This immediately implies the equality for the following conditional distributions: 

p{mi= x,m2 = [f{s) - x\d\s) = ^p{mi = x,m2 = [f{s) - x]d\s) 

= p{mi = x\s) 

= ^P{mi= x,m2 = [f{s) - x\d\s) 

mi 

= pim2 = [f{s) - x]d\s), 

(2.23) 

for all x G Z^. The non-signalling condition further implies that p{mi = x\s) is 
equal to p{mi = x\si) and 

p{mi = x\si) = p{m2 = [/(s) - x\d\s2), (2.24) 

which must be satisfied for all s and all x. We will show that repeated application 



of (2.24) for varying s allows us to prove that all non- marginal probabilities are 
equal provided that /(s) has a non-n-partite linear element. 

A function /(s) can be decomposed into a non-n-partite linear and n-partite 
linear part, i.e./(s) = [g(s)+h(s)](i with /i(s) as a non-n-partite linear function. 
For every function, the n-partite linear part g{s) = [gi{si) + g2{s2)\d can be 
removed by local operations performed by each party; gj{sj) is a single-site map 
that can be deleted from each party's outcome. Therefore, we only need to 
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consider functions /(s) without an n-partite linear part. By lemma[T| we know for 
functions /(s) without an ra-partite linear part, /(O, 0) = /(O, S2) = f{si, 0) = 



for all si and S2- So repeatedly applying (2.24) gives 



p(m2 = [-x]d\s2) = pirrii = x\si = 0) 

= p{m2 = [-x]d\s2 = 0) 
= p(mi = x\si) 

= p{m2 = [f{si,S2) - x]d\s2) (2.25) 

for all X. Repeated iteration implies for the ath iteration, 

p{m2 = [-x]d\s2) = p{m2 = [a/(si, S2) - x]d\s2) (2.26) 

for all a G Z^. The function /(s) is non-n-partite linear so there must be at least 
one value of {si,S2} where /(si,S2) is non-zero. Since d is prime, a/(si,S2) 
takes on all values in Z^, therefore the marginals are p(m2|s2) = dr"^ for all 
771-2. If the marginals are uniformly random for one particular input S2, because 
p(m2 = [-x]rf|s2) = p{mi = x\si = = p{m2 = [-x]d\s2) for S2 / S2, they will 
be uniformly random for all inputs. 

Therefore, by the non-signalling conditions p{mi\si, S2) = p{mi\si, S2) = 
and p(m2|si, S2) = p{m2\s[, S2) = d~^ implying p(mi|0, S2) = p{rni\0,0) = d~^ 
and p{m2\si,0) = p{m2\0,0) = d~^; the marginals for all s must be completely 
random. Applying equation ( |2.24 ) implies that p{mi,m2\si, S2) = d~^ for all m 



such that [mi + m2]d = /(s). □ 



2.3.2 Multipartite Generalisations of the PR box 

Lemma [3] shows that for every vertex of V outside of C for n = 2 and d being 
prime, there is only one non-signalling probability distribution compatible with 
this vertex. As a corollary, V captures a lot of the structure of MS but with the 
space of statistics considered being smaller. We now go further and show that 
this one-to-one correspondence exists for n > 2. 

Previous work has explicitly found the vertices of J\fS for n = 3, Cj = 2 
for all j and d = 2 |Pironio2011] . This work revealed that multipartite non- 
signalling probability distributions can have an extremely complicated and un- 
intuitive structure. For more general scenarios, very little is understood or been 
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investigated. Our approach, culminating in the following result, shows that cor- 
relators can give an insight into the multipartite structure of AfS. 



that is non-bipartite lin- 



Theorem 4. For every function f : 
ear, the only non- signalling distribution compatible with the corresponding vertex 
pik\s) 



Sjf^g'^ in V is 



p{m\ s) 




otherwise. 



(2.27) 



Proof: As well as the above distribution of the form of (2.8) but for bipartite 



linear functions /(s), we can explicitly construct another non-signalling probabil- 



ity distribution other than (2.8). This distribution can produce the corresponding 
vertex p{k\s) = (5^^^^^ of V for a bipartite linear function /(s) and is 

= <! and E,^^. mj]d = f2{s^'^'''^'^), (2-28) 

otherwise, 



since a bipartite linear function can be written as 



(2.29) 



for all functions /i : ©jg j and /2 : 0jg j-c '^cj for strict sub-set 

J and complement J^. The distribution is non-signalling across the partition 
as well as amongst the parties in the sub-set since in the sub-set it has the form 



(2.8). 



This, therefore, leaves non-bipartite linear functions and their corresponding 
non-signalling probability distributions. As mentioned, every partition into J 
and J''^ can be seen as a situation with two parties, where each side of the parti- 
tion makes a choice from cj = Y\j,^j Cj and cjc = Ylj^jc Cj inputs respectively; 
each partition also adds all their outputs togethers modulo d to obtain collective 
outputs mj = [J2jeJ ^J^d. and mjc = [^j^^jc mj]d respectively. As a result. 
Lemma jsj now applies and if the resource produces p{k\s) = 5^^^^ for /(s) be- 
ing a non-bipartite linear function, for all partitions into and ^7^, then we 
obtain p{mj\s^^^^'^'^^) = p{mjc\s^^^^'^'^''^) = d^^. This means all output strings 
ij^{j\j<^J} and m^-^l-^^^^J' for all strict sub-sets occur with equal probability, unlike 
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the distribution in (2.28). This necessarily results in the distribution of the form 
(2.8) thus proving the theorem. □ 



The uniqueness relation between vertex of V and a distribution in NS says 
something about the vertices olNS. This results from the extremality of vertices 
of V and the following result that says all non-signalling probability distributions 
that produce a vertex of V must form a face of NS. The uniqueness result of 
Theorem |4] then collapses the face to a single vertex. 

Proposition 5. Every non- signalling probability distribution that produces a ver- 
tex of V forms a face of MS . 

Proof: Every non-signalling probability distribution p(m|s) can be written as 
a convex combination of the set E of extreme points of A/'5, i.e. 

p(m|s) = '^p{E)pE{m\s) (2.30) 

E 

where is a vertex distribution oi MS and p{E) > and J2eP(^) ~ ^■ 

Of the set E, a sub-set of extreme points E' will each result in the same vertex 
of V, and their convex combination will always result in a vertex kE of "P. The 
region of distributions in MS which is formed by the convex hull of extreme 
points in E' is called £. 

First, we will point out that £ has no points in the interior oiMS and elements 
of £ are only on the boundary (i.e. surface) of MS. If we take the convex 
combination of an extreme point p£;/(m|s) in E' and an extreme point p^/(m|s) 
in the set of extreme points not in E' , then we have the convex line: 

qpE' (mis) + {1 - q)pp>{ni\s), (2.31) 

for < g < 1. If f has any elements in the interior of MS then £ is intersected 



by at least one of the convex lines of (2.31 ) for q or q 1. However, ii q ^ 1 



then this means that a probability distribution cannot result in the deterministic 
correlator kE in V, thereby leading to a contradiction. Therefore £ must lie in 
at most a facet of MS because if it lies on one or more facets, then there will 
necessary be interior points of MS in £. 

Finally, we now show that if iS is a A-dimensional sub-space of MS, it does 
not lie in Af, a A'- dimensional sub-space (or A'-face) of MS where A' > A. As a 
result, £ must be a face oi MS. If £ lies in a larger space X, then X has at least 
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one more extreme point than E\ this would mean that points in E can be written 
as a convex combination of extreme points in E' and not in E' . A contradiction 
again emerges as we would not obtain a deterministic correlator kE i'n V. □ 

Proposition 6. A vertex p{k\s) = ^^(^g^ of V corresponding to f{s) being a 
non-bipartite linear function results from a single vertex ofNS. 

Proof: Since there is only a single non-signalling probability distribution re- 
sulting in the vertex p{k\s) = of V for /(s) being a non-bipartite linear 
function, the region £ from the proof of Proposition 1 will necessarily consist of 
one extreme point. Therefore, £ becomes a 1-face or vertex. □ 

The space of all possible correlators V, uniquely captures properties of a full 
probability distribution that only satisfies special relativity. The study of MS 
has been motivated recently by foundational issues of what distinguishes quan- 
tum physics from something unphysical (e.g. |Pawlowski2009j ) . Vertices of J\fS 
have also been studied in the context of being an information theoretic resource 
|Barrett2005b| . Possession of particular resources that produce a vertex of MS 
not achievable with LHV or quantum resources (e.g. PR boxes) can lead to an 
information processing advantage in certain tasks (e.g. communication complex- 
ity |Brassard2006j ). It has also been suggested that PR boxes can be seen as a 
unit of non-LHV correlations (often abbreviated as "non-locality") , though there 
is evidence both for and against this suggestion |Barrett2005c| . The fact that 
the space of correlators captures generalisations of the PR box (with respect to 
extremality of MS) motivates the study of correlators as a smaller-dimensional 
problem revealing more general structures. 

The bipartite linear functions are not only of relevance to Proposition |6] but 
also of relevance to the next section. In the next section, we discuss a generali- 
sation of correlations discussed by George Svetlichny |Svetlichnyl987| ; these are 
correlations that exceed LHV correlations but do involve the space of all possible 
correlations. Interestingly, as Svetlichny has shown, these correlations do not 
fully capture all quantum correlations. 

2.4 Svetlichny Correlations 

George Svetlichny suggested an extension to the standard model of local hidden 
variables in the many-party scenario. More specifically, Svetlichny introduced 
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the scenario where there are three parties, and two parties are allowed to share 
whatever correlations they wish, but they are restricted to sharing only an LHV 
with the third party |Svetlichnyl987| . Therefore if parties 1 and 2 can share 
whatever correlation they wish (it could even not respect special relativity) , then 
party 3 only shares some local hidden variable A G A with 1 and 2, to obtain the 
following distribution: 

p(m|s) = / p{X)dXp{mi,m2\si,S2,X)p{m3\s3,X), (2.32) 
Ja 

with the probability distribution p{X)dX over A with Jj^p{X)dX = 1. There is no 
reason to privilege some parties over others and we allow permutations of parties 
so labels can be swapped, i.e. {1, 2, 3} — )• cr{{l, 2, 3}) and a is just a member of 
the permutation group. 

In full generality, we can allow probabilistic combinations of distributions of 



the form (2.32) but with permutations of parties to give 

p(m|s) = pi^2 / Pi,2WdXp{mi,m2\si,S2,X)p{m3\s3,X) 
J A 

+Pi,3 / Pi,3{X)dXp{mi,'m3\si,S3,X)p{m2\s2,X) 
J A 

+P2,3 / P2,3WdXp{m2,m3\s2,S3, X)p{mi\si, X), (2.33) 



A 



with Pi J and Pij{X) for i, j £ {1,2,3} and i ^ j being probabilities for a 
particular permutation such that pi^2 +Pi,3 +P2,3 = 1- Therefore, all Svetlichny- 



type correlations in the form of (2.33) are in a sub-region of J- that is a convex 



polytope Sjr; the extreme points of Sjr are distributions of the form (2.32) but 
with both probabilities p{mj,mk\sj, s^) and p{mi\si,X) being deterministic for 



2.4.1 Three-party Generalised Svetlichny Correlators 

Since Sjr is a convex polytope, it will be defined as the intersection of half-spaces 
defined by a set of linear inequalities in analogy with the facet Bell inequalities. 
Svetlichny actually originally described his set of linear inequalities of correlators. 
We shall now take this original approach and describe Svetlichny correlations 
in terms of correlators where S is the space of Svetlichny correlators for three 
parties. The following result captures this space S in terms of the description of 
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functions tliat we have used in the last two sections. 



Proposition 7. The space S of Svetlichny correlators for three parties is the con- 
vex hull of vertices p{k\s) = (5^^^^ ofV corresponding to bipartite linear functions 



Proof: If we take a probability distribution of the form in (2.32), then it can 
itself be written as a convex combination of deterministic probabilities of the 
form 

for the maps : Ijcj x^c^ '^d>^'^d and g^ : Ij^ — ?• I^d with j ^ k ^ I ^ {1, 2, 3}. 



The probability in (2.34) is defined for all possible maps g^ and g'^, therefore we 



can rewrite (2.32) as a convex combination of these deterministic probabilities 



and permutations of {1, 2, 3} to give 



{i,fc,«}efT{l,2,3}gi,g2 

where Pgi,g2 > is defined over all maps such that J^^i g2Pgi^g2 = 1. Therefore 
Sjr is the convex hull of extreme points defined by all possible maps of the form 
g^ and g'^ for all different labellings of parties. 

For correlators, we take the sum modulo d of all outcomes. Taking the 
sum [mj + mk]d results in all maps of the form g^ now becoming all maps 
of the form : Zc^ x Z^, Zd- Finally, the sum of all outcomes is now 
[mj + ruk + rni]d = /(s) = [f^{s) + /^(s)]^ where = g^. These functions 
/(s) are by definition bipartite linear functions and so S is the convex hull of 
correlators resulting from bipartite linear functions. □ 

A facet Svetlichny inequality is a linear inequality that defines a facet of S 
in analogy with the facet Bell inequalities. One of the original facet Svetlichny 
inequalities for the setting with three parties, cj = d = 2 for all j can be written 
in terms of correlators as |Svetlichnyl987| 

p(l|000) +p(l|001) +p(l|010) -p(l|011) 
+p(l|100) -p(l|101) -ij(l|110) -p(l|lll) <2. (2.36) 

Interestingly, despite the fact that we allow any possible correlation to be shared 



between two of the three parties, correlators in Q still violate (2.36) with the 
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quantum (Tsirelson-Svetlichny) upper bound 1 + \/2 |Svetlichnyl987l . Whilst 
quantum correlator vectors may be outside the space S, this Svetlichny polytope 
is not strictly smaller than the space of quantum correlators, i.e. some vertices 
of S are not achievable with quantum correlators. 

2.4.2 Multipartite Svetlichny Correlators 

The above discussion has been restricted to Svetlichny's original work for three 
parties. It is natural to ask how this approach generalises to more than three 
parties. One could suggest a model where we allow only at most two out of 
n parties to share whatever correlation they wish and then share local hidden 
variables with the other (n — 2) parties. We will go further, and in line with 
other approaches (e.g. jBancal2009| IBancal20lf] ). partition n parties into two 
sub-sets and parties in each of the two sub-sets is allowed to share whatever 
correlations they wish (signalling or otherwise). Then each partition only shares 
a local hidden variable A € A (with probability distribution p{X)dX) with the 
other partition to obtain correlations of the form: 

p(m|s)= / p(A)dAp(m^-''l-''e^>|s^^'l^'^^>,A)p(m^^'l^'e^'>|s^-''l^'^^'>,A), (2.37) 
J A 

where n parties are partitioned into sub-sets J' and J^. 

As with three parties, we allow convex combinations of distributions in (2.37) 
for all (2"^-*^ — 1) different partitions into strict sub-sets and J'^. Correlators 
resulting from this generalised Svetlichny model can again be expressed as a 
convex polytope as a generalisation of Proposition [7j the following result now 
captures this generalisation. 

Theorem 8. The space S of generalised Svetlichny correlators for n parties is 
the convex hull of vertices p{k\s) = (^jj-^) of V corresponding to bipartite linear 
functions f{s). 



Proof: The correlations in (2.37), as with the three-party case, can be written 
as a convex combination of deterministic probabilities resulting from determin- 
istic maps labelled g^ and g^: 

P(m|s) = 2^ Pgl,g25^i(^{,|,g^})(5^2(,0|,ej-}), (2-38) 

where g^ : ^j^^j^c, ^ z|f ' and g^ : ®j^jcZc^ ^ z|f' ' with Pgi^g2 > and 
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X^gi g^Pg^,g'^ — 1- Now if we take the sum modulo d of all outcomes then we 
obtain the following correlators: 



p{k\s) = ^ p;i,^2<^f^i(,)_,^.(,)]^, (2.39) 

with all possible maps of the form : '^Cj — ^ and : 0jg j-c l^cj — ^ 

and the distribution P/i ja > such that p Pf^.p = 1- 



If we allow all possible correlators of the form (2.39) for all possible partitions 
into J and J"^ then S is the convex hull of all deterministic correlators corre- 
sponding to functions /(s) = [/-"^(s) -|-/^(s)]rf. These are all of the bipartite linear 
functions by definition. □ 



The structure of bipartite linear functions gets translated from the three-party 
case to the n-party case. Despite the fact that we allowed signalling correlations 
within partitions of the n parties, we can impose the non-signalling conditions 
on all parties once again. This means that even within a sub-set of parties, the 
correlations they share must satisfy special relativity. Interestingly, even if we 
apply this restriction, the space of Svetlichny correlators for many parties is still 
S as defined by Theorem |8j This is simply because all deterministic correlators 
(or vertices of V) can be achieved with non-signalling probability distributions 
NS. All the deterministic correlators associated with bipartite linear functions 
can be achieved with probability distributions in NS. 

If one assumes that all correlations satisfy special relativity, then non-signalling 
correlations not achievable with Svetlichny-type correlations are said to be "truly 
n-partite non-local" |Bancal20091 IBarrett2005bj . They are "non-local" in the 
sense that across all partitions of n parties, the correlations of the parties are not 
described by the parties sharing a local hidden variable. Therefore, the vertices of 
V that are not associated with bipartite linear functions can only result from truly 
n-partite non-local correlations. Of the non-signalling correlations in NS, then 
for each of these vertices of V there is one truly n-partite non-local distribution, 
or vertex of NS as described by Theorem [4| 

Instead of allowing all possible correlations within a sub-set of all parties or just 
allowing non-signalling correlations, one could allow correlations "in-between" 
that allow some, but not all forms of communication. Indeed, these issues have 
been investigated by Barrett and Pironio |Barrett2011aj . If one is only concerned 
with correlators, then whatever form of restricted, or unrestricted, communica- 
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tion within a partition of all parties, the space of Svetlichny-type correlators is 
S as described by Theorem [8| The space of correlators is conserved and we can 
always discuss the possibility of distinguishing between a model that permits, in 
part, an LHV description and something inconsistent with this model. 

2.5 Chapter Summary 

In this chapter we have motivated and presented the study of Bell correlators 
in a natural generalisation of the Bell-CHSH test. We have also discussed how 
finding Bell inequalities that define the space of LHV correlations/correlators is 
in general a hard problem. Motivated by this, studying correlators instead of a 
full probability distribution, we reduce the size of the problem, if not reducing 
the general hardness. 

The language of stochastic maps and functions has been key to describing the 
correlators resulting from particular theories (both physical and non-physical). 
This description of correlators in terms of computational expressiveness is key to 
the central results of not only this chapter, but this entire thesis. To summarise, 
each potential theory has its own computational expressiveness and character- 
ising this gains an insight into "which computations the theory is capable of 
performing". These ideas will be generalised in subsequent chapters to take into 
account data processing in Bell tests but the computational expressiveness in- 
sight will be key. Importantly, this computational point-of-view on correlators 
has allowed us to characterise the well-studied structures of LHV correlators in 
a new language. 

This interpretation of correlators in terms of computation has also produced 
new results. We showed that vertices of the polytope of all correlators can cor- 
respond uniquely to vertices of the non-signalling polytope. As well as this, we 
have described the space of Svetlichny correlators in terms of computational ex- 
pressiveness. Again, this description of Svetlichny correlations gives us a new 
insight into well-studied areas of research. 
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3 Constructing Bell Inequalities and 
Quantum Violations 



In the previous chapter, we focussed mostly on the description of the local hid- 
den variable (LHV) polytope in terms of its vertices. Now we shift to a facet 
representation of the LHV polytope in terms of the facet Bell inequalities: linear 
inequalities defining the facets of this polytope |Finel982l Pitowskyl989| . If a 



correlator is outside of the polytope it must necessarily violate at least one of 
these inequalities. However, recall that finding them is a hard problem. 
A Bell inequality is a linear inequality of the following form 

(d-i) 

Yl A,sP(A:|s) < jc, (3.1) 

s fc=l 

for some real coefficients Pk,s where 7£ is the tight upper bound for all LHV 
correlators in £ We introduce the vernacular that a "Bell expression" is the 
left-hand-side of ( |3.1[ ) . We make the distinction between Bell expression and Bell 
inequality as we can substitute correlators not in C into a Bell expression and 
they could violate a Bell inequality. 



We optimize over values /3^..s and in (3.1) to find the facet Bell inequalities. 
But this optimization, in the worst case, is a hard computational task. In this 
chapter we look for these facet Bell inequalities but only manage to find them for 
a select number of scenarios on a desktop PC using Polymake |Polymake2000l . 
We give some indications of the possible connections between the violations of 
facet inequalities and the possibility of performing a non-n-partite linear function. 
However, this connection is not completely clear as the structure of £ is in general, 
rather complicated. On the other hand, we review the results of Werner, Wolf, 



Zukowski and Brukner [Werner 200 It IZukowski2002 in the n party, 2 input, 2 



^In the literature, tight Bell inequalities are synonymous with facet Bell inequalities. Our use 
of the word tight reflects that the Bell inequality intersects the LHV polytope at (at least) 
one of its extreme points. 



68 



output scenario and relate the structure of £ in this scenario to a particular class 
of Boolean functions. 

In spite of the difficulty in understanding the structure of £ and even finding 
the facet Bell inequalities, we find a general class of Bell inequalities that have 
a natural computational perspective. We call these inequalities non-trivial Bell 
inequalities. They are non-trivial in the sense that they provide a separation 
between all possible correlators in C and all possible correlators in V . We go 
on to relate these inequalities to an information processing paradigm called a 
"non-local game" |Cleve2004] . We then use the construction of a non-local game 
to derive more of these non-trivial Bell inequalities. 

Finally in this chapter, we make interesting connections between the discus- 
sion of Bell inequalities and Measurement-based Quantum Computing (MBQC) 
|Raussendorf200 1 \ IRaussendorf2003l |Briegel2009[ IJozsa2006] . In particular, we 
show that a sub-class of computations in Briegel and Raussendorf 's construction 
of MBQC |E,aussendorSnnT| IRaussendorflHoa] can be cast as non-local games. 
Through the language of non-local games, we relate these quantum computations 
to non-trivial Bell inequalities. All of these connections truly highlight the rich 
interplay between the foundations of quantum mechanics and its applications. 



The original material in sections 3.1 and 3.2 along with subsections 3.3.1 and 



3.3.2 were completed in collaboration with Joel Wallman and Dan Browne and 
published in part as |Hoban2011c| . The subsections of 3.1.5 and 3.2.3 consist 



of rederivations of results in |Werner2001j with a focus on the computational 



description of correlators. The original work in subsection 3.3.3 and section 3.4 



were done in collaboration with Earl Campbell, Klearchos Loukopoulos and Dan 
Browne and published as |Hoban2011a| . 



3.0.1 Notation 

From now on, we simplify the scenarios of Bell tests that we consider by having 
the number of inputs at each site being the same, i.e. Cj = cy for all j ^ j' . We 
introduce the notation (n, c, d) to describe Bell tests with n parties, c inputs and 
d outputs at each site. We also carry over the notation from chapter [2]of £, 5, Q 
and V being the LHV polytope, the Svetlichny polytope, the space of quantum 
and all possible correlators respectively for each scenario {n,c,d). 

The majority of the remainder of this thesis will be devoted to the study of the 
(n, 2, 2) scenario. We privilege this scenario by assigning it a particular notation 
not shared by any others. Since the number of the inputs at each site is the same. 
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inputs are always s G ©j=i = • As a result of this simplification, we will 
no longer use the notation to describe the Cartesian product of groups Zc^. . 
We will use to denote summation modulo 2, i.e. = •••]2- This notation is 
used only in the (n, 2, 2) scenario along with the notation © to describe addition 
modulo 2, i.e. © = [... + ...Jg. Modulo 2 multiplication between elements in Z2 is 
exactly multiplication of these elements for standard arithmetic. Therefore, for 
the (n, 2, 2) scenario and only this scenario we re-write expressions in modulo 2 
arithmetic in terms of this notation. For example, the expression [xiX2 + X3 + Ijg 



becomes xia;2©X3©l, and, 



2 



becomes ( 0^=i a^j ) © X5 © 1 



re notation from the previous 



For scenarios other than (n, 2,2), we retain t.: 
chapter. That is, all arithmetic in [■■■]^ is modulo x arithmetic. Even if either 
c or d is equal to 2 (but not both), we will use the notation [...]2 for modulo 2 
arithmetic. 

3.1 Facet Bell Inequalities 

In this section, we will discuss the facet Bell inequalities for particular (n, c, d) 
scenarios. We used the Polymake package of algorithms to find the facet Bell 
inequalities for a small number of cases |Polymake2000| . These are the {n,c,d) 
scenarios where finding the inequalities was computationally tractable on a desk- 
top PCj^ We will show that these inequalities can be grouped together into 
symmetries, or in group theoretical terms, orbits; these orbits are generated by 
operations that preserve the region C |Pitowskyl991| . For the number of (n, c, d) 
scenarios studied, we will describe elements in these orbits. Then we discuss the 
facet Bell inequalities for the (n, 2, 2) scenario; there is a closed-form expression 



for these inequalities [Werner 200 H IZukowski2002 



In Table 3.1 we have listed the number of facet Bell inequalities for a few sce- 
narios that could be computed using Polymake. Included in the number of facet 
Bell inequalities are the normalization and {d — l)c" positivity inequalities 
that define V. Despite these dc" inequalities, there are still a significant number 
of inequalities remaining. On the other hand, Pitowsky has shown that correla- 
tion polytopes have certain symmetries |Pitowskyl991| . These symmetries are 
generated by operations on the inputs and outputs as well as permutations of 
parties. The group of these symmetry operations generates orbits of facet Bell 



^iMac with 2.4 GHz Intel Core 2 Duo (TM) Processor and 2 GB 800 MHz DDR2 SDRAM. 



70 



n 


c 


d 


# Vertices 


# Facet Bell inequalities 


2 


2 


2 


8 


16 


2 


2 


3 


27 


66 


2 


2 


4 


64 


216 


2 


2 


5 


125 


1020 


3 


2 


2 


16 


256 


3 


2 


3 


81 


125,412 


2 


3 


2 


32 


90 


2 


4 


2 


128 


27,968 



Table 3.1: A table of number of facet Bell inequalities for each scenario {n,c,d) 
and the number of vertices for the LHV polytope. 

inequalitie^ Every facet Bell inequality in each orbit can be mapped to every 
other inequality in that orbit via these symmetry operations. Therefore, we do 
not need to consider every single facet Bell inequality for each (n, c, d) scenario 
but only one inequality in each orbit. In the following subsection we consider 
these symmetry operations. 

3.1.1 Symmetries of the LHV Polytope 

Pitowsky has shown that given a facet Bell inequality for an LHV correlation 
polytope, we may find more inequalities by some simple operations on data m 
and s |Pitowskyl991| . These operations G map from the set £ of extreme points 
of C to themselves, i.e. G : £■ ^ 8. By convexity, we only need to consider the 
extreme points. The symmetry operations G that produce these maps are the 
following: 

1. permutations of parties - {sj, Sj, s„} — )• {sj/, Sji, where k' = a{k) 
is an element of the permutation group S"" of order n; 

2. relabeling of measurement scenarios - sj — )• sj + aj for some aj G Zc; 

3. relabeling of measurement outcomes - rrij — t- mj+b{sj , j) where b{sj , j) £ TLf^. 

The operations G and their products GG' (for either G ^ G' oi G = G') form a 
group G such that G G G. There are n! permutations of n parties and c" ways of 
relabeling measurement scenarios. Since for each input Sj we add a value b{sj), 
for each input s, 5(s) = X]j=i is added to X^"=i rrij. There will be at most 



^We are using the terminology used by Werner and Wolf [Werner2001j . 
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n 


c 


d 


# Facet Bell inequalities 


# Orbits 


2 


2 


2 


16 


2 


2 


2 


3 


66 


2 


2 


2 


4 


216 


4 


2 


2 


5 


1020 


5 


3 


2 


2 


256 


5 


3 


2 


3 


125,412 


63 


2 


3 


2 


90 


2 


2 


4 


2 


27,968 


15 



Table 3.2: The number of orbits for each scenario (n, c, d) under the symmetry 
operations described in the text. One of the orbits for each scenario 
is the orbit of normalization and positivity conditions. 



d^"' values of b{s). In total, there are at most nlc^d^^ elements of G in order for 
there to be closur^ 

The n-partite linear functions are closed under all of these operations. Using 
the facet-defining condition, the vertices of C that saturate a facet Bell inequality 
must be equivalent to another set of vertices in £; this new set also saturates a 
facet Bell inequality. In group theoretic terms, if we have one facet Bell inequal- 
ity and perform all possible sequences of operations G, then the set of facet Bell 
inequalities produced by these operations forms an orbit (see the use of termi- 



nology in [Werner2001j ). In Table 3.2 we have listed the number of orbits for 



each of the scenarios in Table 3.1 These orbits were numerically found using a 
search algorithm on all of the facet Bell inequalities. For each instance of (n, c, d), 
it was found that one of the orbits consists of the normalisation and positivity 
inequalities; we call this orbit the "trivial orbit". Orbits which do not include the 
normalisation and positivity inequalities are called "non-trivial orbits". 

For each of the (n, c, d) scenarios, we only need to consider one inequality from 
each orbit. For the (2,2,2), (2,2,3) and (2,3,2) scenarios, there is only one 
non-trivial orbit. In each of these scenarios, we then only need to consider one 
inequality. If one of these inequalities in each orbit is violated by a quantum 
correlator, then the above symmetry operations can be applied to that quantum 
correlator so that it will violate every other inequality in said orbit. The possibil- 
ity of violation of facet Bell inequalities with quantum correlators is, as a result, 



*In principle, the number of operations could be smaller as the values of b(s) may be overcom- 
plete for all possible transformations. For example, in [Werner2001) the cardinality of G is 
n!22"+i. 
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rendered easier to study. 

For the (2, 2, 2) scenario, as Fine has also shown in |Finel982] . the only facet 
Bell inequality we need to consider is the CHSH inequality |CHSH1969j . In the 
following subsection we consider other facet Bell inequalities for n = 2. We 
show that the CHSH inequality and a generalisation in d (for c = 2) of this 
inequality (the CGLMP inequality |CGLMP2002j ) between them generate a lot 
of the structure of £. In later subsections 13.1.41 and 13.1.51 we will discuss the 
tripartite and multipartite scenario (i.e. for n > 2). First we briefly introduce 
some new notation. 



3.1.2 Notation for Bell inequalities 

We now introduce a piece of notation to describe all Bell inequalities. If we write 
vectors k of correlators that have elements p(A;|s), we can express an inequality as 



an inner product. The real pre-factors /3fc^s of (3.1 ) are elements of an (d — l)c"'- 
length row vector b G M^'^"-^)'^". Therefore, every inequality results from the 
Euclidean inner product b ■ p < of these two vectors. 

We adopt a convention to order the elements Pk,s of b from left-to-right starting 
with /3i^o and ending with f3(^d-i),c with c = {(c— 1), (c— 1), (c— 1)}, the digit- 
string of all inputs being (c — 1). To be explicit, each digit-string s E Z" can be 
written as an integer in Z, the set of positive integers. Digit-strings s G can 
be ordered in terms of these integers in Z. For example for c = 2, the digit-string 
s = {1,0,0} corresponds to the integer 4 and for c = 3 the same digit-string 
is equal to 9. We order elements /3i^o from left-to-right for increasing values of 
k £ for each ordered value of s. 

To give a concrete example, the CHSH inequality |CHSH1969j 

p(l|00) -hp(l|01) +p(l|10) -p(l|ll) < 2, (3.2) 

corresponds to the vector b = (/3i,{o,o}> /3i,{o,i}, /5i,{i,o}, = (1,1>1,-1)- 
We will employ this notation for specific values n, c and d. For brevity, in more 
general expressions we may choose to write the inequality in terms of the sum in 



(3.1 ). In the next subsection we will write both in terms of the sum in (3.1 ) and 



the vector notation introduced above. 
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3.1.3 Bipartite facet Bell inequalities 

In this subsection we will restrict ourselves to the n = 2 scenario for particular 
values of c and d. The CGLMP inequality |CGLMP2002] is a facet Bell inequahty 
for all d in (2, 2, d) scenarios, as shown by Masanes |Masanes2003j . For all d, this 
inequality can be written as 

d-i 

CcGLMP = dxp(l|0,0)-j;(-l)^i+^Ml|s)+^(-l)^i+^2 J2id-k-l)p{k\s) < d. 

s s k=2 

(3.3) 

The CHSH inequality is exactly this inequality when d = 2. For d = 2, 3, the 
only non-trivial orbit is generated by the CGLMP inequality. Whilst for d = 4, 5 
the CGLMP inequality generates one of (d — l) non-trivial orbits. For all possible 
correlators in V, the maximal value of the left-hand-side of the CGLMP inequality 
is 2d— 1, thus violating it. In fact, for all d, this maximal violation of the CGLMP 
is obtained by a vertex of V corresponding to the function /(s) = [siS2 + l]d, i-e. 
the correlator p(k\s) = , n . 

In the (2, 2, 2) scenario there are 2^ — 2^ = 8 non-n-partite linear functions 
and also 8 inequalities in the non-trivial orbit of the CHSH inequality. This is 
no coincidence as every Bell inequality in this orbit in maximally violated by a 
vertex of V corresponding to a non-n-partite linear function. This also occurs for 
the (2, 2, 3) scenario where there are 3^ — 3^ = 54 non-n-partite linear functions 
and 54 inequalities in the orbit of the CGLMP inequality. It can also be checked 
that every inequality in this orbit is violated by a different non-n-partite linear 
function. 

For (2,2,4), one of the orbits is generated by a generalisation of the CHSH 
inequality 

CL, = E(-l)'''' + ^^(318)] < 2. (3.4) 

s 

This expression is essentially the CHSH inequality if each party groups their 
outcomes nij into modulo 2 terms. Since 1 mod 2 is equal to 3 mod 2, each party 
just maps from modulo 4 arithmetic to modulo 2. For all possible correlators in 



V, the Bell expression in inequality (3.4) achieves the value of 3. This value is 



achieved for two vertices of V corresponding to functions /(s) = [siS2 + 1]2 or 
/(s) = [siS2 + 1]4. Therefore the one-to-one relationship between inequality and 
maximal violation from a vertex of V breaks down for d = 4 (and also d = 5). 
This is confirmed by the number of facet Bell inequalities in non-trivial orbits 
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for (2, 2, 4) being 216 — 26 = 200 whereas the number of non-n-partite hnear 
functions is 4"^ - 4^ = 192. 

The third and final non-trivial orbit for (2, 2, 4) is generated by the following 
inequality (expressed in the notation described earlier): 

C^=4 = (1, 2, 1, 1, 2, 1, 1, 2, 1, -1, -2, -1) • ^ < 4. (3.5) 

It is worth noting that this can be constructed by adding 2(— l)*i*2p(2|s) to 



the left-hand-side of the previous inequality (3.4). The maximal value of 6 of the 
left-hand-side (i.e. Bell expression) results from the vertex of V corresponding 
to the function /(s) = [2siS2 + 2]4. 

For (2,2,5), there are 4 non-trivial orbits. One of these is generated by the 
CGLMP inequality and the other three are given by 

Xi = ^ (6, 2, 3, 4, 4, -2, 2, 1, 4, -2, 2, 1, -4, 2, -2, -l)-k< 5, 

12 = (3,1,-1,-3,2,-1,-4,-2,2,-1,-4,-2,-2,1,4,2) •fc< 5, 

Xs = (2, -1, 1, -2, 3, 1, -1, 2, 3, 1, -1, 2, -3, -1, 1, -2)-k< 5. (3.6) 

The inequality for the Bell expression Xi and the CGLMP inequality are max- 
imally violated by the vertex corresponding to /(s) = [siS2 + Ijs- The Bell 
expressions X2 and X3 are maximally violated by the vertex corresponding to 
/(s) = [2siS2 + 1]5. As we can seen there is a corresponding function for each of 
these inequalities that leads to a maximal violation. 

We now consider scenarios with c > 2 but with d = 2. As can be seen from 



Table 3.2 for the (2, 3, 2) scenario there is only one non-trivial orbit. The Bell 



inequality generating this orbit is another generalisation of the CHSH inequality: 

2 

c,=3 = Y.(-iy''' 11(^0^ + K')pm < 2. (3.7) 

s j=l 

For the (2, 4, 2) scenario, three of these non-trivial orbits are forms of the CHSH 
inequality embedded in the larger number of inputs. For completeness, we have 



listed all 14 Bell inequalities in Table 3.3 We now explicitly write out one of 
these inequalities: 

2 

CU = ^i-^r'' 11(^0^ + ^i)P(M^) < 2- (3.8) 

s j=l 
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Table 3.3: The facet Bell inequality expressions that each belong to a particular 
non-trivial orbit for (2,4,2). Each row corresponds to a particular 
inequality belonging to a different symmetry class. Each column of 6 
is an element of this vector that forms an inner product with p. The 
LHV upper bound for inequalities B\ to B^ is 8 and 4 for B^ B\\. 

which is almost exactly the same as C^^g. The other two inequalities, C^^g and 
C^^3 are similar to this inequality except with altered delta functions for C^^g via 
the substitutions: 

n(5^^- + <5^0 ^ (-^o^ + + <^r), (3.9) 

J=l 

and for C^^g: 

^{t^ + ?>'l) ^ {bl^ + b^iWo- + 5^^). (3.10) 

We can see that the CHSH inequality generates a lot of the structure of the LHV 
polytope in the bipartite scenario. In general though, we have given some insight 
into the richness of structure of L. This might give some indication why finding 
the facet Bell inequalities is a complicated task. All of this discussion is even 
before we consider more than 2 parties. In the following subsection we discuss 
the n = 3 case. Despite not having as many results in this scenario due to the 
scaling of the size of M^''"^)'^" in n, we show some of the structure of L can be 
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obtained from the n = 2 scenario. 



3.1.4 Tripartite facet Bell inequalities 

We have given an indication that facet Beh inequahties for n = c = 2 have 
a computational interpretation. Every facet Beh inequahty we have found is 
maximaUy violated uniquely by a vertex of V when d = 2, 3, and 5. In this 
sense the violation of a facet Bell inequality can quantify how computationally 
powerful a theory is. For situations with n > 2, this becomes more complicated 
even for n = 3 and c = d = 2. The Mermin inequality |Merminl990j which we 



introduced in the first chapter (see section 1.2) can be expressed as 



p(l|000) +p(l|011) +p(l|101) -p(l|110) < 2, (3.11) 

and forms a non-trivial orbit |Werner2001j . This inequality is maximally vio- 
lated by more than one vertex of "P. If expressed in terms of expectation values 
of measurements, it can be generated from the CHSH inequality by a form of 
substitution |Werner2001j . WW showed that all inequalities for (n, 2,2) can be 
generated by this substitution |Werner2001j . We now discuss a possible method 
of doing this for (3, 2, 3). 



Analogously to the Mermin inequality (3.11), we define a CGLMP inequality 
for three parties using the two party inequality. We have three parties but now 
we only consider non-zero terms in a Bell inequality when the third party's input 
is S3 = 0. For LHV correlators p(A;|si, 52, 0) the n-partite linear functions that 
can be achieved are /(s) = [aisi + a2'S2 + a3]3 with ai, 02, 03 S Z^^: then-partite 
linear functions on two variables si and S2- Since the CGLMP inequality is facet- 
defining for the region of LHV correlators for two parties, or variables si and S2, 
it is facet-defining for this space of the n = 3 correlators for S3 = 0. Then we 
can write the tripartite CGLMP inequality as 

CcGLMP = d X p(l|0, 0, 0) - J^(-l)^i+^Mlkl, 52, 0) 

s 

d-l 

+ Y,^-iy'^'^Y.^d-k-l)p{k\suS2,^) <d. (3.12) 



k=2 



For the case of (3,2,3), this tripartite CGLMP inequality is facet-defining and 
forms an orbit of 324 inequalities. There are 61 other non-trivial orbits for 
(3, 2, 3). Inequalities from each of these orbits can be found in the supplementary 
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material in |Hoban2011c| . Interestingly though, the Mermin inequality (3.11) 
above which can be rewritten as: 



p(l|000) +p(l|011) +p(l|101) -p(l|110) = ^(^,%,^(-l)^iXl|s) < 2, (3.13) 

s 

does not generalize directly to the (3, 2, 3) scenario. If we were to naively write 
the generalisation as 

CcGLMP = X p{i\o, 0, 0) - J] 6fi_^^^.^{-ir+'MM^) 



+S2]2 ^ 
s 

d-l 



+ J2^[!,+s,],i-^r^''Y.(d-k-l)p{k\s) <d, (3.14) 

s k=2 

then the right-hand-side is not c? = 3 in the (3, 2, 3) scenario but 2d — 1 = 5, the 
algebraic upper bound for all possible correlators and not just LHV correlators. 
This upper bound of 5 is achieved by vertices of V corresponding to the function 
/(s) = [siS2 + 1]3. However, if parties produce the n-partite linear function 
/(s) = [2si + 2s2 + S3 + 1]3 and the only non-zero terms in the above inequality 
occur when [si + S2]2 = S3, then /(s) = [2si + 2s2 + [si + S2]2 + 1]3 = [siS2 + 1]3- 
Despite the fact that some of the facet Bell inequalities can be obtained from 
bipartite inequalities, understanding the full structure of £ is a difficult task in 
general. For example, the straightforward substitution of the CGLMP inequality 
into expressions for (3, 2, 3) still leaves a large number of orbits without charac- 
terisation. On the other hand, C in the (n, 2, 2) scenario is well-understood as 



a hyperoctahedron | Werner 200 H IZukowski2002 . The facet Bell inequalities can 



be described in terms of Boolean functions where each facet inequality results 
from each particular Boolean function. In the following subsection we review 
the insight obtained by Werner and Wolf |Werner2001j as well as Zukowski and 



Brukner Zukowski2002 



3.1.5 Multipartite facet inequalities for {n,2,2) 

So far we have found facet Bell inequalities numerically. The size and hardness 
of the problem means that as n gets larger, finding the facet inequalities quickly 
becomes intractable on a desktop PC. Convex polytopes are generalisations of 
the polyhedra and the geometry of these objects has been studied for thousands 
of years |Grunbaum2003| . A natural question to ask is whether there are an- 
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alytical tools in convex geometry that can help us define C in terms of linear 
inequalities? This is not immediately obvious in the case of general (n, c, d) but 
the case of (n, 2, 2) has been amenable to this approach. Werner and Wolf (WW) 
independently with Zukowski and Brukner (ZB) have shown that in this specific 



case, £ is a hyperoctahedron |Werner200H IZukowski2002 . 

Out of preference, we follow the WW construction of facet Bell inequalities 
|Werner2001j . Augmenting this approach we will use a central result from the 
previous chapter that C is the convex hull of n-partite linear functions. For the 
(n, 2, 2) scenario, these n-partite linear functions are the linear Boolean functions. 
The linear Boolean functions are a class of functions that have existed in the 
study of computer science and propositional logic well before our usage here. For 
example, linear Boolean functions are generated in error correction such as with 
the Hamming code |MacWilliamsl977] . The following result demonstrates yet 
another application of the study of linear Boolean functions. 

Corollary 1. The space C of LHV correlators in the (n, 2, 2) scenario is the 
convex hull of linear Boolean functions. 

Proof: Since this corollary is a special case of Theorem [2] we just need to show 
that for the (n, 2, 2) scenario, all the n-partite linear functions are the linear 
Boolean functions. Linear Boolean functions /(s) for an n-length bit-string s 
can be written in terms of the Algebraic Normal Form (ANF) as: 

/(s) = |^0«iSij e6, (3.15) 

where aj, b £ {0, 1}. Whereas, an n-partite linear function g{s) in this scenario 
can be written as 

n 

5(s) = 05i(s,), (3.16) 

i=i 

for single-site map gj : Z2 — t- Z2. Crucially, as a special case, all single-site 
Boolean functions of this form can be expressed as gj{sj) = hj®aj6\^ = ajSj(Bbj 
since = Sj for aj, bj G {0, 1}. Therefore, take the sum modulo 2 of all of these 



maps and setting b = 0"=i bj returns the expression in (3.15). □ 



The above corollary is a rederivation of the LHV convex polytope that was de- 



rived by WW and ZB jWerner200H IZukowski2002 . However, this rederivation 
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is in terms of a language of computational expressiveness whereas the original 
derivation is in the language of expectation values of measurements with out- 
comes ±1. The "linearity" (in the Boolean function sense of the word) is not 
explicit but buried in the mathematical derivation of C. The language of compu- 
tational expressiveness sheds a new light on an old result and this new perspective 
will become central to a lot of discussion in this chapter; the next chapter will 
also have Corollary [T] at its heart. 

As mentioned above, both constructions due to WW and ZB use the expec- 
tation values of E(s) = p(0|s) — p(l|s) rather than the correlators themselves. 
However, due to the "law of the excluded middle" giving IE(s) = 1 — 2p(l|s), 
expectation values and correlators are in one-to-one correspondence. For brevity 
of reproduction of results, we will work in terms of E(s) and then map back to 
correlators |?(l|s) at a final stage. 

Taking on this notation, we construct all Bell inequalities in the (n,2,2) sce- 
nario in the following way [Werner 2001] : 



< 1, 



(3.17) 



such that the real coefficients /3s always give 1 for LHV correlators. We are just 
choosing a normalisation convention without loss of generality. By convexity we 
only need to consider the extreme points of C which correspond to the linear 
Boolean functions. We rewrite these extreme points ^(s)^; in terms of the ex- 
pectation values, i.e. IE(s)£; = '^ki~^)^^g{s) ~ (~1)^'"^'' where ^(s) is a linear 
Boolean function. We are also only interested in extreme points that maximally 
saturate the upper bound as these extreme points will define a facet. Putting all 



of this information together, we can rewrite (3.17) as: 



(3.18) 



where 7g(s) £ {0, 1} depends on the linear Boolean function. The linear Boolean 
functions can be written as g{s) = (©"^lajSj) ® b but the overall sign (— 1)* 



leaves (3.17) unaffected. Therefore we only need to consider linear Boolean func- 
tions with 6 = 0, thus leaving 2*^ such functions. In order to show that the 



inequalities in (3.18) are facet-defining, then we must form affinely independent 



2^-length vectors with elements (— 1)^^®^ for each s. To demonstrate affine inde- 
pendence we utilise the following lemma. 
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Lemma 9. For 2'^-length vectors g € M^" with elements g{s) being the non- 
constant linear Boolean functions, a set of (2" — 1) vectors g are linearly inde- 
pendent as long as no two vectors, g^ and (f, corresponding to two linear Boolean 
functions g^{s) and g^{s) respectively, have all elements g^{s) = 5^(5) © 1. 

Proof: We demonstrate linear independence by mapping linear Boolean func- 
tions from Z2 to M. Every linear Boolean function can always be expressed as 
fi'(^) — (©^^=1 '^j^j) ® ^ ^'^d is non-constant as long as at least one value of aj is 
non-zero. For two functions ^"'^(s) = ^'^(s) © 1, 6 is 1 for either of the functions 
and for the other. Mapping from Z2 to M, we can write ^^(s) © 1 as 

g^s)®! = 1-9^8). (3.19) 

Therefore if we prove that the (2" — 1) non-constant linear Boolean functions 
di^) — (©j'=i ^j^j) produce (2" — 1) linearly independent vectors g, then this 
holds if functions are g{s) © 1. This is true if the set of linear Boolean functions 
does not include two functions g^{s) and ^^(s) where g^{s) = g^{s) © 1 for all s. 

First, all variables Sj will produce vectors g that are linearly independent from 
all g resulting from by construction where k ^ j. As a shorthand, we say that 
a function is linearly independent from other functions if the associated vectors 
g are linearly independent. We show that linear Boolean functions dependent 
on more than one variable Sj are linearly independent. We start with the linear 
function si © S2 which can be rewritten as 

Sl © S2 = Sl -h S2 - 2siS2. (3.20) 

This expression is linearly independent from functions si and S2 due to the s\S2 
term being multiplicative. This function will also be linearly independent from 
all functions Sj © Sfe 7^ si © S2 due to SjSk being linearly independent from siS2- 
Having shown that all linear Boolean functions dependent on 2 variables and 1 
variable are all linearly independent from each other, we proceed inductively. For 
functions dependent on 3 variables Sj, eg. ^(s) = si © S2 ® S3, we can again map 
this function into standard arithmetic as 

Sl © S2 © S3 = Sl -I- S2 -I- S3 - 2(siS2 "I" S1S3 -|- S2S3) -|- 4siS2S3. (3-21) 

Again S1S2S3 is linearly independent from all terms SjSk for j ^ k and single 
variable terms sj, as well as all linear functions sj © © s/ 7^ si © S2 © S3. 



81 



Proceeding inductively for each function ^(s) = (0"=]^ '^j-^j) with q non-zero 
values of Uj, writing ^'(s) in standard arithmetic we have the product of these 
q elements of s. This product of q elements of s is linearly independent from 
all other products of q and q' < q elements of s. The linear Boolean function 
di^) = (©j=i ^j) can thus be written as 



0' 



1 



l_J|(l_2s, 



(3.22) 



This function is finally linearly independent from all other linear Boolean func- 
tions due to the term nj'=i ^j- Therefore all of the non-constant linear Boolean 
functions g{s) = ©j=i ajsj produce vectors g that are linearly independent. □ 



As a result of this lemma, the extreme points (—1)^*^®) for the linear Boolean 
functions g{s) = ©"^^ ajsj are afhnely independent. The dimension of V for 



(n, 2,2) is 2", and so (3.18) is facet-defining if this expression is satisfied for all 



of these linear Boolean functions. 



The key observation made by WW is that (3.18) is a discrete Fourier Transform 
and its inverse is 

Ps=^Y.(-^y''''(-^y^^^ (3.23) 

which is now a sum over all linear Boolean functions ^(s) = (©"^^^ ^j^j)- There- 
fore, for each facet Bell inequality we now have some choice of the variables 
7g(s) G {0, 1} for all functions g{s). There are then 2^" possible choices of these 
2" values of 7g(s)- We now express ( |3.18 ) in terms of correlators p(l|s) instead 
of expectation values IE(s), 



s < 



l-Es/3s 



G{0,1}. 



(3.24) 



The sum of coefficients /3s is equal to ±1 as it is equal to (— l)'''9{s) when 
g{s) = for all s. There are therefore 2^" facet Bell inequalities in the (n, 2,2) 



scenario of the form in (3.24). 



In the (n, 2, 2) scenario, we show that if we deal with expectation values we 
can derive all of the facet inequalities. As mentioned above, all of these inequal- 
ities can be obtained through substitution of the CHSH inequality in terms of 
expectation values |Werner2001j . The CHSH inequalities are expressed as a poly- 
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nomial in measurement operators on two sites, called "Bell polynomials". Every 
other inequality for n > 2 are multiples of these polynomials with measurement 
operators on other sites. This substitution of the CHSH inequality is clear in the 
expectation value scenario but not so clear in the correlator description. Despite 
this drawback, the insight we gain from Lemma [9] allows us to demonstrate that 
a particular inequality for each n is facet-defining as we now show. 

We have utilised a form of substitution in constructing tripartite CGLMP 
inequalities by having non-zero terms in the inequality when the input satisfies 
a particular constraint, e.g. si = S2- But not all inequalities in the (n, 2,2) 
scenario can be constructed from the CHSH inequality by this simple method. 
For example, the following facet Bell inequality in the (3, 2, 2) scenario as found 
by WW |Werner2001j . 

^ b(l|000) +p(l|001) +p(l|010) +p(l|011)] 
[p(l|100) +p(l|101) +p(l|110) - 3p(l|lll)] < 1 (3.25) 

has non-zero coefficients for all inputs s. However, we can generalise this inequal- 
ity to n parties utilising the result from Lemma 1 (in a slightly modified form). 
A generalisation of this inequality is 



2"-i 



'Vl|l) + ^p(l|s)j < 1. (3.26) 



It is worth noting that this inequality not only reduces to (3.25) for n = 3, but 
also the CHSH inequality for n = 2. 



We observe that the upper bound on the right-hand-side of (3.26) is saturated 



for all (2" — 1) linear Boolean functions g{s) = y^^^i (^jSjj ® 6 7^ where 
g{l) = 0. The upper bound is also saturated when g{s) = 1 for all s. These 
2" linear Boolean functions are also affinely independent by the argument of 
Lemmajo] For a particular linear Boolean function g^{s), only one out of the two 
functions g^{s) and g^{s) © 1 satisfy the condition that g{l) = 0. Therefore the 
set of (2" — 1) linear Boolean functions g{s) = (^^^^i cijSj^ with some of these 

functions having 1 added mod 2, will be the set g{s) = ^0"=]^ Oj^j) ® 6 7^ 
where g{l) = 0. By Lemma |9| the former set forms a linearly independent set of 
functions, and we can just add the constant function g{s) = 1 for all s to make 
an affinely independent set. 
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We have used an insight from the computational perspective of LHV correlators 

in the (n, 2, 2) scenario to define a facet Bell inequality for all n. Interestingly, 

for all correlators in V, the inequality in (3.26) is only maximally violated by 

Yl" s ©1 ^ 

the correlator p(l|s) = 6-^ ■'"^ ^ = 11^=1 ® ^ corresponding to the function 

/(s) = YYj=i ^- This is contrary to the Mermin inequality which is 

maximally violated by more than one correlator in V. 

So far in the discussion in this chapter, we have described facet Bell inequal- 
ities. They define the space of C They also guarantee that if a correlator is 
outside of £, it must violate one of these facet Bell inequalities. We have shown 
throughout that this violation can be achieved (uniquely or otherwise) maximally 
by particular vertices of V. Heuristically then, a violation of a Bell inequality 
can be associated with a computational advantage. The advantage being that 
non-LHV correlators can be associated with computations of non-n-partite lin- 
ear functions. This insight will be utilised in section [3^ where Bell inequalities 
may not be facet-defining, which can highlight the computational advantage of 
non-LHV theories. 

Of the possible theories that can be associated with non-LHV correlators, 
quantum theory is currently the only working theory. Whether the predictions 
of quantum theory in the form of a violation of a Bell inequality can be verified 
in a laboratory will be discussed in chapter |4j In the next section, we will discuss 
quantum correlators, or the space Q. We will explore methods used to find the 
maximal violations of Bell inequalities possible with quantum theory. This will 
give some indication of the extreme points of the space Q. 



3.2 Quantum Violations of Bell Inequalities 

We have described the structure of £ in terms of the facet Bell inequalities. We 
now give some indication of the structure of Q. By giving an indication, we 
mean that we find the maximal violation of the facet Bell inequalities. It is still 
an open question of defining the extreme points of Q in general. In the specific 
(n, 2, 2) scenario, WW have described the extreme points of the quantum region 
[Werner2001j . but otherwise, we can only numerically find particular extreme 
points. 

In this section, numerical methods |Navascues20081 IKaszlikowski2000] used to 
find the maximum quantum values of a Bell expression are reviewed. Using these 
methods we present numerical values for the bipartite facet Bell expressions we 
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found in subsection 3.1.3 In particular, we find the maximum quantum values 
for an expression in each orbit. Therefore, finding this value for an expression in 
an orbit also finds the quantum value for all expressions in that orbit. This is 
because the set of quantum correlators are also unaffected by the local operations 
on values m and s and permutations of parties. 

We also comment on the relationship between entanglement and violation of 
bipartite facet Bell inequalities. We show that the maximal quantum violation 
may not be achieved by a maximally entangled quantum state. Although a 



violation is a "witness" of entanglement (see section 1.4), more entanglement 
may not mean more non-classicality. 

We present the result of WW that all extreme points of Q have a closed 
form [Werner2001j . The maximum quantum value of all Bell expressions is an 
optimization over these points. What is more, these maximal expressions can 
be obtained from projective measurements on the n-party Greenberger-Horne- 
Zeihnger (GHZ) state |(mZ1989llWerner2nni] . The GHZ state can be considered 
as a natural, if ambiguous |Plenio2007] . multipartite generalization of the maxi- 
mally entangled state. 



3.2.1 Numerical Methods for finding Violations of Bell Inequalities 

In the literature, there are two main methods of finding violations of Bell inequal- 
ities. The first approach which we call the "multiport beam-splitter" or MBS 
approach |Kaszlikowski20d0l IDurt200T] . This method fixes the quantum state 
shared by both parties as the maximally entangled state 1^) = X]j=o^^ 
We then optimize over projective measurements made by each party to find a 
lower bound of the maximum quantum violation of a Bell inequality, if a violation 
occurs. 

A second, more general approach for finding a quantum violation of a Bell 
inequality involves semi-definite programming (SDP) |Boyd2004] . Therefore we 
call this approach the "SDP approach" as developed by Navascues, Pironio and 
Acin |Navascues20071 INavascues2008j . This approach involves constructing a 
positive semi-definite Gram matrix of (sequences of) correlations. The Bell ex- 
pression is then a linear function on elements of this matrix and we maximize this 
linear, or "objective" function. This second approach produces an upper bound 
on the violation of a Bell inequality. However, if the Gram matrix satisfies a 
certain property (called a rank loop) then the maximized objective is equal to 
the maximal violation of a Bell inequality |Navascues2007] . On the other hand, 
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if we do not satisfy this property if the lower bound produced by the MBS ap- 
proach is equal to the upper bound of the SDP approach then we have found the 
maximum quantum violation. 

Both of these approaches have been developed in the bipartite scenario but 



can be extended to the multipartite scenario |Navascues20071 IZukowskil999 
Naturally though, with an increasing number of parties, the optimization for 
both approaches becomes harder for a desktop PC. In this subsection, we only 
use these two methods for finding bipartite quantum violations, so we will only 
describe them in these two scenarios. We now proceed to describe each approach 
in more detail. 

The MBS approach is described as follows |Kaszlikowski2nnni iDurtMIT] . The 
quantum state shared by two parties is first fixed as the d^-dimensional maximally 
entangled state \"^) = -^Yl'j=o both parties attain measurement out- 

comes associated with projectors (/Uj = Vsj\f^) {klVsj , where {|/c)|A; G I^d} 
is the standard basis of H^. The Vsj is a unitary matrix and can be written 
as Vgj = FDg. where F is the d-hy-d Quantum Fourier Transform matrix with 
elements for the jth row and fcth column Fj^k = • '^^^ d-hy-d ma- 

trix Dg^ is a diagonal matrix Dsj = diag(e*'^i*^'^j), e*'^^^*^^ e*'^'*^''^^) with (t>j{sj) 
as real phases. Therefore we optimise over these phases (f>jisj) to numerically 
maximize the quantum violation for the maximally entangled state. 

This first approach can be modified further by altering the quantum state after 
optimization of the phases (f)j{sj), as indicated by Acin et al |Acm200"2] . We first 
obtain the optimal angles 4>j{sj) found for the maximally entangled quantum 
state. We then substitute these optimal angles into the projectors Vsj\k){k\Vsy 
Then we construct the Bell expression in terms of these optimal projectors giving 

s \s,m / 

= (V'lWlV^) (3.27) 

where \ip) is not necessarily the maximally entangled state |^'). Finding the 
largest possible quantum value of the Bell expression is then a case of finding the 
largest eigenvalue of W. Acin et al used this method to find a larger quantum 
violation of the CGLMP inequality for (2,2,3) with a non-maximally entangled 
state |Acm200"2] . We will discuss the connection between entanglement and Bell 



inequality violation in subsection 3.2.2 
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We now briefly present the SDP approach. Central to the SDP approach is the 
construction of a positive semi-definite Gram matrix T. The elements Fj^ of this 
matrix are Tjk = (V'lOjO/d^) where Oj is a linear combination of products of 
projectors Emj,sj that depend on rrij and Sj at each jth site. These projectors cor- 
respond directly to probabilities of getting mj given Sj, i.e. p{mj\sj) = {Emj,Sj)- 
The projectors act on an arbitrary dimension Hilbert space which is shared by 
aU parties. They also satisfy (^lE'm^^sJV) > for all states l'^), -Emj.sj = -^mj.sj 
(Hermiticity), Em^^sjE^'^^s, = (orthogonahty) and J2mj ^rnj,sj = I- 

We associate the degree of this product (i.e. the number of terms in the 
product of projectors) with a set of quantum operators, i.e. for degree of products 
being i' we have the set Qu- For example, the set Qi can be associated with 
the identity matrix I and Oj consisting solely of linear combinations of single 

projectors Em s - Q2 is the set of 2-term products Em s-Em' s' for s'- 7^ Sj 

J' J J ' J j J J 

and Emj,SjEm'.,s -I foi' J 7^ f ■ Another set of interest is intermediate sel|^ 

between Qi and Q2, where we have all the operators which are the pairwise 
product of projectors between parties j and j' where j' 7^ j. 

The set Qoo of all products of projectors is then the set of all values of 
(V'lOjOfcl'i/') possible with quantum mechanics. However, it is possible that F^,, 
the Gram matrix of operators associated with Q^, may already contain all values 
(^/'lOjOfclV') ill Qoo- If this occurs then the rank of F^, is equal to the rank of 
Fiy_i, resulting in a "rank loop". For more detail see |Navascues2008| . 

To find the quantum upper bound for a Bell expression we perform the follow- 
ing semi- definite program: 

maximize tr(i?"'"F) 
subject to F > 

tr(FTF) = 0, jG{0,l,...,x}, (3.28) 

where B \s a. matrix of the coefficients of the Bell expression for each probability 
p(m|s) = (YYj=iEmj,Sj) ■ The matrices Fj are x linear constraints on elements 
ofF. Of course, Fqq will be infinitely large, so if we restrict at first to F^, we 
obtain an upper bound on ii{B^T) for all quantum probabilities. It is an upper 
bound as there are fewer constraints on the elements of F, and so F might not be 
compatible with quantum physics. The bound can then be subsequently lowered 

^In |Navascues2008| . this set is written as Qi+ab where A and B represent two parties, and 
the set includes pairwise products of the projectors for each party. 
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if we consider matrices (corresponding to the set Q2) which will impose more 
constraints on the products of projectors compatible with quantum physics. 

The value of tr(i?"'"r) will be the true quantum value (up to numerical error) 
if we have a rank loop as described above. Semi-definite programming forms 
part of the subject of convex optimization |Boyd2004] . There are algorithms for 
dealing with semi-definite programming such as those in the packages of YALMIP 
[Yalmipl and SeDuMi [SeDuMi] . We utilise these numerical methods to find 
quantum bounds of Bell expressions, and also to look for a rank loop. However, 
we do not need to look for a rank loop if the value of tr(i?^r) is equal to the 
lower bound of the MBS method within numerical error. These two methods 
then give us an indication of the extreme points of Q. 

In the construction of the SDP approach we did not explicitly say that n = 2. 
Indeed this method can be utilised in the multipartite case but in order to have 
the correlations of n parties in F, one needs to go to at least The MBS 

approach can also be generalised to the multipartite scenario but again the prob- 
lem becomes more complicated. In the following subsection we will utilise both 
the MBS and SDP approaches to find the maximal quantum violations of all 
bipartite facet Bell inequalities. Therefore, consideration of multipartite gener- 
alisations will not be relevant for our discussion. 



3.2.2 Bipartite Quantum Violations and Entanglement 

We now describe the maximal quantum violations of facet Bell inequalities for 
n = 2. We used both methods described in the previous subsection first finding 
a lower bound using the MBS approach and then the SDP approach to confirm 
that this is the maximal value. We list all of the maximal quantum violations 



for n = 2 facet Bell inequalities numerically in Table 3.4 The numerical error in 
these values is of the order of ±10"^ and maximal violations resulting from both 



the MBS and SDP approaches agree within this error. Also in Table 3.4 we have 
indicated which maximal violations result from the maximally entangled state 
1^') = Sj=o^^ Ijj)- ^'^^ d ^ 2, there are instances where maximal violation is 
not a result of maximal entanglement. 

While the construction of Bell inequalities was initially partly motivated by the 
issue of entanglement, the connection between entanglement and violation is not 
completely clear. A violation of a Bell inequality indicates that measurements 
are made on an entangled state, but entanglement does not necessarily result in 
a violation of a particular inequality [Werner 1989| . For the CHSH inequality, the 
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n 


c 


d 


Orbit 


LHV bound 


Quantum bound 


Entanglement 


2 


2 


2 




2 


2.41421' 


1.000 


2 


2 


3 


CcGLMP 


3 


3.9149 


1.555 


2 


2 


4 


Cc'GLMP 


4 


5.4594 


1.938 


2 


2 


4 


*-d=4 


2 


2.4142 


1.000 


2 


2 


4 




4 


4.8284t 


2.000 


2 


2 


5 




5 


6.3145 


2.310 


2 


2 


5 




5 


7.6290 


2.310 


2 


2 


5 




5 


7.0314 


2.230 


2 


2 


5 


CcGLMP 


5 


7.0314 


2.230 


2 


3 


2 




2 


2.4142^ 


1.000 


2 


4 


2 


Bi to ^5 


8 


9.7570^ 


1.000 


2 


4 


2 


^6 to Bii 


4 


5.0825t 


1.000 


2 


4 


2 


Cc=4 to Cc=4 


2 


2.4142t 


1.000 



Table 3.4: We list the bipartite maximal quantum violations for particular facet 
Bell inequalities for c and d. We have grouped the orbits of inequal- 
ities Bi to (increasing numerically in the label of the inequality 
as they have the same LHV and quantum upper bounds. The same 
grouping also applies for inequalities Bq to Bu. Those violations that 
are achieved with the bipartite maximally entangled state of (P dimen- 
sion are labelled with a f. We also present the numerical calculation of 
entropy of entanglement for the pure state associated with each max- 
imal violation. Recall that the maximally entangled state will have 
entanglement log2(c/). 
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maximal violation allowed by quantum mechanics is produced by the maximally 



entangled state |Tsirelsonl980| . As we can see from Table 3.4 this is not true 
in general. Also in Table 3.4, we have calculated the entanglement of the pure 
state that maximally violates each inequality. The entanglement of bipartite pure 
states £ €5 'H'^ in d^-dimensional Hilbert space is calculated from the 



entropy of entanglement |Plenio2007j . Interestingly from Table 3.4 



the entanglement of the state that maximally violates the CGLMP inequality 
decreases with d. 

It has been established previously that a violation of a Bell inequality and 
entanglement are two different, but related issues |Vidick201H |Liang2011| . For 
example, statistics that violate a Bell inequality can be seen as a "resource" for 
demonstrating non-classicality, and entanglement can also be seen as a resource 



(see section 1.3). It has been shown that these two resources are different if one 
wants to use one resource to simulate the statistics of the other [Brunner2005j . 

In subsection 3.2.1 we mentioned that if one attains a rank loop between a 
Gram matrix and another Gram matrix Ty^i, then the quantum value of Bell 
expression has reached its maximum value for Fjy. For all of the examples in 
Table 3.4, there was a rank loop found between Fg and Fi. This observation is 
confirmed for the CGLMP inequalities by results obtained by Navascues, Pironio 
and Acm |Navascues2008| . This leads us to conjecture that the maximal quantum 
value resulting from Q for all bipartite Bell expressions for correlators is obtained 
from correlations in the set 

In this subsection we have indicated that all of the bipartite facet Bell inequal- 
ities found in this chapter are violated by quantum correlators. However, the 
maximum possible violation is not achieved by the relevant maximally entangled 
state. This implies it might not be favourable to use a maximally entangled state 
for the largest violation. This behaviour has also been observed when considering 
Bell inequalities expressed in terms of elements of the full probability distribu- 
tion p(m|s) |Vidick201H |Liang20TT] . In the next subsection, we describe the 
quantum region Q for the (n, 2, 2) scenario. The connection between maximal 
violation and quantum state is also far clearer for all n; it results from the GHZ 
state |GHZ1989] . The GHZ state for n = 2 case is the maximally entangled state 
for d = 2. 
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3.2.3 Quantum Upper Bounds of {n,2,2) Bell Inequalities 

We now consider the maximal quantum violation of any Bell inequality in the 
(n, 2, 2) scenario. We state the following result (as obtained by WW |Werner2001j ) 
in terms of the maximal quantum value of a Bell expression. 

Theorem 10. The maximal quantum value of a Bell expression for the (n, 2, 2) 

scenario is 



sup 



E 



+ 



2"" 



(3.29) 



where 6j are n angles, or real parameters. These maximal quantum values result 
from von Neumann measurements on the GHZ state: 



\GHZ) 



1 

71 



(gin 



+ II)®'') . 



(3.30) 



Proof: We map from the correlators p(l|s) to the expectation values IE(s) for 
measurements, or observables having outcomes ±1. For quantum correlators, 
the measurements are Hermitian operators Mgj = Q^. — Ql. where QT/ are the 
projectors corresponding to outcome ruj. Therefore, —I < Ms - < I and M^. = I 
where I is the identity matrix. The expectation value is then for all pure states 
\ip), E(s) = {ip\ (S)j=i ^Sjli^), which can be substituted into a Bell expression to 
achieve the maximal quantum value 



sup 



(3.31) 



It remains then to minimize the expression /3s (V' I ^^=i ^sj\'4') over all states 
and choice of measurements. This equates to finding the minimum eigenvalue of 
the operator J2s l^s 'S)^=i , or the operator norm || ... || of — /3s (S)j=i ■ 
To find this operator norm, we need to diagonalize the operator and we can do 
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this in the following way since the identity commutes with all operators: 



sup 



M, 



i=i 



^o|||l5^/3s(g)(MoMi 

j=l s j=l 

n 

s j=l 



sup 



(3.32) 



We obtain this sum of complex terms as MqMi = Uj is a unitary matrix as 
(MoMi)t = MiMq and {MoMi)^MoMi = I. The last line is then the norm of a 
linear combination of unitary matrices. 



We now show that the value of (3.32) is attained by observables Mg. on the 



GHZ state |GHZ) = ^ (|0)®" + We prove this by construction where 

each party's measurement is 



(3.33) 



With these measurements, we obtain the following expectation values 



Ps{GHZ\ (g) Ms,\GHZ) = - ^ /3, cos n(/> + s.Oj ) . (3.34) 

s j=l s \ j=l 



We can write the expression over which we take the supremum in (3.32) as 

n 

J2 (3,e'^^U ^.^.) = ^ /3,Re U^+^U ^.^.)) = J] cos (^ + s.O,). 



i=i 



(3.35) 

We choose (/> = and so the optimal values of 9j in (3.32) can be substituted 



into the measurement in (3.33). Therefore, these measurements on a GHZ state 



attain the maximum quantum upper bound of a Bell expression. □ 



A corollary of this theorem is that since the quantum correlators 

n 

p{l\s) = 1 - 2cos(V' + Ysj9j) 



(3.36) 



92 



can be optimized to maximally violate a Bell inequality, these correlators are 
extreme points of Q for all V'? ^j- The space Q must contain every one of these 
extreme points, and so is the convex hull of these correlators |Werner2001j . 

We now illustrate how the above theorem can be used to find the maxi- 
mal quantum violation for the CHSH and Mermin inequality respectively. The 
phase values {V'j^jli ^ {1; •••;"■}} for the CHSH inequality are = —\ and 
^1 = ^2 = f • Substituting this into ( |3.32 ), we obtain Tsirelson's bound 1 + \f2 
|Tsirelsonl980] . For the Mermin inequality ( fslsl ), = and 6*1 = 02 = -6*3 = - f 
we have the maximal quantum (and algebraic) upper bound of 3. The quantum 
violations for facet Bell inequalities in the (3, 2, 2) and (4, 2, 2) cases are listed in 
|Werner2nnT] . 

In this section, we focussed on the quantum violation of facet Bell inequalities 
in various (n, c, d) scenarios. However, all of the methods described so far apply 
to any Bell inequality, facet-defining or otherwise. We have used the facet Bell 
inequalities to show that in all of the scenarios investigated, Q is strictly larger 
than C The facet Bell inequalities are associated with their own difficulty; we 
have only shown that Q is larger than L for a small number of scenarios where we 
could actually find the facet Bell inequalities. On the other hand, if we suspend 
the necessity for the facet-defining condition and demonstrate a violation of an 
arbitrary Bell inequality, then Q is still strictly larger than £. In the next section, 
we will consider Bell inequalities that are not facet-defining and show that they 
are of importance for considering quantum correlations. These inequalities are 
also of relevance when considering information processing tasks. 



3.3 Non-trivial Bell Inequalities 

Bell inequalities were first constructed in order to show that the statistics re- 
sulting from LHV theories |Belll964l ICHSH1969j are constrained; this constraint 
does then not apply to quantum theory. The facet Bell inequalities go further and 
not only constrain LHV statistics but also define the space of LHV correlators. 
We have indicated that to find these region-defining inequalities is a difficult task. 
However, if we just want to find Bell inequalities that distinguish between LHV 
and non-LHV correlators, satisfying the facet-defining condition is not necessary. 
We say that Bell inequalities are "non-trivial" if there are correlators in V that 
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violate it, i.e. 



(d-i) 

5Z l^kM^l^) <lc< IV, 



(3.37) 



s k=l 



with fSk^s as real pre-factors and as the upper bound resulting from all corre- 
lators in £; 7p is the upper bound of the inequality for all possible correlators 
in V. As indicated above, for a non-trivial Bell inequality, there is the strict 
separation jc < Iv- 

We describe an explicit set of Bell inequalities that are non-trivial. We then em- 
ploy a connection between these inequalities and an information processing task 
called a "non-local game" |Cleve2004] to derive an infinite number of non-trivial 
Bell inequalities. We begin our discussion in the simplest scenario by discussing 
the CHSH inequality and utilise its "computational nature" |vanDam2000j . We 
show that the intuition of the CHSH inequality as measuring the ability to per- 
form a non-linear Boolean function with classical correlations can be applied to 
all scenarios. Again, central to our discussion is the computational perspective 
of LHV correlators. We utilise the limited computational expressiveness of LHV 
theories to derive consequences of this limitation. 

3.3.1 Non-trivial Inequalities as Generalisations of the CHSH 
Inequality 

When the CHSH inequality jCHSH1969| was originally derived, the characterisa- 
tion of correlations in terms of convex polytopes had not yet been considered. It 
may be considered a happy coincidence that this inequality is facet-defining for 
the LHV polytope. Despite being placed in the context of convex polytopes, the 
CHSH inequality has been redefined in the context of non-local games |Cleve2004] 
as we shall discuss in the next subsection. Such a versatile inequality also has 
a computational perspective that helps understand why it puts a restriction on 
LHV correlators |vanDam2000j . We will exploit this perspective to derive a gen- 
eralisation of the CHSH inequality for all (n, c, d) scenarios. 

In order to describe this computational perspective we again write out the 
CHSH inequality 



p(l|00) +p{l\Ql) +p(l|10) -p(l|ll) < 2, 



(3.38) 
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and make the substitution p(l|ll) = 1 — p(0|ll), to obtain 

p(l|00) +p(l|01) +p(l|10) +p(0|ll) = 
1 

EE'^'i-2eiP(^|s) < 3. (3.39) 



s k=0 



LHV correlators p{k\s) are contained in the convex hull of linear Boolean func- 
tions g{s) on s. So, p{k\s) = Y.g{s)P9{s)^g{s) "^'^^^ Pais) ^ and Eg{s)Pg{s) = 1- 
Then, by convexity, the following expression must be satisfied for all linear 
Boolean functions g{s) in the (2,2,2) scenario: 



s k=0 



ECiei < 3. (3.40) 



By listing all possible functions g{s) and seeing when they overlap with siS2 ® 1, 
we see that the maximum overlap is 3. We can then rewrite the original CHSH 
inequality in terms of correlators f (l|s) and this derivation of the LHV upper 
bound 7£ 



p(l|00) + p{l\01) + p{l\W) -p(l|ll) < max 



1. (3.41) 



Essentially, this inequality "measures" the inability for LHV correlators to achieve 
the non-n-partite linear function siS2®l deterministically |vanDam2000j . If LHV 
theories could achieve this function deterministically then 7£ = 3 as ^^/{J) ~ ^' 
This is, however, not possible and this is the upper bound 7-p for all correlators 
in "P so 7-p > 7£. 

The CHSH inequality is not the only example of a well-studied Bell inequality 
that can be written in terms of the overlap between a non-n-partite linear and n- 
partite linear function. The Svetlichny inequality |Svetlichnyl987| as mentioned 
in chapter [2j section "lA 



p(l|000) +p{l\001) -Fp(l|010) -p(l|011) 
-Fp(l|100) -p(l|101) -p(l|110) -p(l|lll) <2, (3.42) 
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can be rewritten as 



eiP(/c|s) < 6, (3.43) 

s fc=0 

after making the substitution of p(l|s) = 1 — p(0|s) when the prefactors in 



(3.42) are —1; this is the case when siS2 ® siss ® S2S3 ©1 = 0. Again, by 
convexity the upper bound of this inequahty just results in the maximum over- 
lap Zls '^f!s2esiS3es2S3ei hnear Boolean functions ^(s) for (3,2,2). The 
function /(s) = S1S2 ® S1S3 ® S2S3 © 1 is a non-linear Boolean function and so 
the overlap ^^/{s) definition will always be lower than 2" = 8. Again, we 
can rewrite the above Svetlichny inequality as 

p{i\ooo) + p{i\ooi) + p{i\ow) - p{i\on) + p{i\wo) - p{i\wi) 

-p(l|110) -p(l|lll) 



< max 

9(s) 



S2©S1S3©S2S3©1 



2. (3.44) 



For all possible correlators in V, the upper bound is then 4 thus it is a non- 
trivial Bell inequality, as expected. However, it is not a facet Bell inequality for 
the region C, but facet-defining for the Svetlichny region, S. The region 5 is a 
sub-region of V but larger than C, therefore bounds the region C Non-trivial 
Bell inequalities can then provide a useful tool to bound C away from the whole 
space V. 

The CHSH and Svetlichny inequalities above notably utilise the fact that linear 
Boolean functions cannot be equal to non-linear Boolean functions for all inputs 
s. Given that LHV correlators are associated with the former and not the latter, 
we can write down inequalities of the following form for all scenarios (n, c, d): 

s A:=0 ^ ' s 

for all non-n-partite linear functions /(s) and n-partite linear functions ^(s). 



The above inequality in (3.45) is defined for all correlators p{k\s) and not the 
normalised set of correlators for k G {1,2, (d — 1)}. Therefore in order to 
describe this inequality in terms of normalised correlators, i.e. vectors in V, 
we impose the normalisation condition that 1 — Ylk=i^ P(^\^) ~ Pi^\^)- The 
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expression on the left-hand-side of (3.45) becomes 



E 



1 



(d-i) 



(d-i) 



fc=l 



fc=l 



< max S'^i^} , 
- g(s) ^ 5(s) 



(3.46) 



which can be rewritten in a form similar to the CHSH inequality, 

"(d-i) 



E 



E * 



k=\ 



7(s) ''/(s) 



P(^|s) 



< max^^ ((^ 



^0 



The upper bound 7£ then is strictly smaller than 7-p = c" 



(3.47) 



so the 



Es'5/(s)' 

inequality is non-trivial. As we have already demonstrated, the CHSH inequality 
and Svetlichny inequality are examples of these non-trivial inequalities. As with 
the Svetlichny inequality, they are not necessarily facet inequalities for but 
necessarily bound the region £. A non-trivial Bell inequality must also intersect 



£ at, at least, one vertex otherwise the right-hand-side of the inequality in (3.47) 
is not tight. 

Not only are these inequalities interesting because of their ability to bound 
£, but they have a role in information processing tasks. The particular task of 
relevance is a non-local game |Cleve2004] . One can be successful at such a game 
if they violate a Bell inequality, hence the use of "non-local", as in non-LHV 
resources. One wants to achieve some task (expressed as a game) with as great 
a probability as possible. Games in general are of interest in computer science 
and in fields of applied mathematics such as economics |vonNeumannl944] . In 
some non-local games such as the "XOR games" [Cleve2004j, the BeU inequahty 
can quantify the probability of achieving a task and so have a natural role in 
these games. We use the language and structure of non-local game to describe 
an infinite number of non-trivial Bell inequalities for each scenario {n,c,d). 



3.3.2 Non-local Games 

We have discussed the operational perspective of Bell tests where we have many 
parties each with inputs and outputs. Many information processing tasks can 
be abstracted to a process with an input, and a transformation of the input to 
produce an output. We now focus on one particular task that has a natural 
connection to Bell tests, the non-local game (NLG) |Cleve2004] . In this section 
we discuss the set-up of an NLG and how it is relevant to the discussion of 
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constructing non-trivial Bell inequalities. In the next section, NLG will again be 
discussed and made relevant to the subject of MBQC. Therefore, these games 
are of relevance to a great deal of discussion to both Bell tests and this thesis in 
particular. They also give an interesting computational perspective on Bell tests 
that has been of interest to the quantum information science community. 

We now describe a particular NLG with n parties, or "players" as they are 
often called. These n parties do not communicate with each other, and so for all 
intents and purposes, are space-like separated as in Bell tests. As well as these n 
parties, there is another party that is not a player, but a "referee". A referee can 
be seen as the experimenter in a Bell test who calculates the correlators p{k\s). 
However, one distinct aspect in NLG from Bell tests is in the role of the referee 
as the person who distributes inputs to the n parties as well as retrieving their 
outputs. In the format of Bell tests that we have discussed so far, the inputs at 
each site are generated randomly by the parties themselves, in NLG this is not 
the case. To summarise, n parties each receive an input from the referee and then 
generate an output which they send to the referee. The referee finally calculates 
some function on the outputs; the objective of these parties is to maximize the 
mean probability (for all inputs) of this function being equal to some desired 
value. We now specify the particular NLG that is of relevance to our discussion: 

1. A referee sends the input digit-string s = to the n non-communicating 
parties. The inputs s are sent with probability distribution 7r(s) such that 

2. All n parties generate an output digit-string m = Z[J which is sent to the 
referee; 

3. The referee calculates the sum modulo d of all outcomes Ej=i "^ild = ^5 

4. The goal of the game is for the players to maximise the average success 
probability of "^"^^i rrij = /(s) for some function f -.121 ^ Z^^. 

Examples of these games include the well-studied multi-party "XOR games" 
where c = d = 2 |Cleve2004] . The average success probability Vf{s) of achieving 
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En 
7 = 1 



rrii 



/(s) can be written in terms of the correlators p{k\s): 



(d-l) 



p{k\s). 



i3X 



k=0 



In order to maximize this average success probabiUty, we want to find the opti- 
mal correlators p{k\s). We can then distinguish between the maximum average 
success probability P^^g) and P^(^^^ resulting from quantum and classical (or LHV) 
correlators respectively. If Pj(s) ^ Pf{s) ' optimal to use quantum resources 



instead of classical resources. Also, (3.48) produces a Bell inequality if the corre- 



lators result from LHV theories which is upper bounded by Pj^g)' 



^2 



we have a violation of this Bell inequality. One way that we can possibly have a 
separation P^^g) < P/(s)' function /(s) is a non-n-partite linear function. 



If /(s) is an re-partite linear function, then 



1. 



for 



To make the connection to Bell inequalities explicit, if 7r(s) = vr(s') = 
all s 7^ s', then we obtain a modification of the non-trivial Bell inequalities (3.45) 



discussed in the previous subsection. We can rewrite the inequality in (|3.47|) in 



terms of p^(^) for /(s) being a non-re-partite linear function: 



(d-l) 

E 

k=l 



p{k\s) 



< 





/(s)' 



(3.49) 



where 



Pm 



max / w , . 



(3.50) 



For (3.48), if the correlators are all possible correlators in V, then the maximum 
average success probability is Pj(g) = X]s'^(^) ~ 1- Therefore, the fact that 



Pf{s) ^ Pf{s) iiidicates that the inequality (3.49) is non-trivial. 

In order to establish a non-trivial Bell inequality, we need to find a probability 
distribution 7r(s) such that Pj(g) < P/(s)- following result, we describe an 

infinite number of simple probability distributions such that we can generate a 
non-trivial Bell inequality. 



Proposition 11. All inequalities of the form 



(d-i) 
E 

k=l 



^m 



^m 



p{k\s) 



< 



Pm 



7r(s)5° 



(3.51) 
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are non-trivial Bell inequalities for all non-zero probabilities 7r(s) if /(s) is a 
non-n-partite linear function. 



Proof: In order to prove this we just need to show that Pji-g) < P/(s) ~ ^ 
the probabihty distribution 7r(s) being non-zero for all values of s. That is, 



maxj]^(s)5^^g<l, (3.52) 



9(s) ^ ' ■ 



which is true as < 7r(s) < 1 for each s by definition and "gls) < 1- 1^ 

For a distribution 7r(s) satisfying < 7r(s) < 1 for each s and X^g^C^) ~ 
we can construct non-trivial Bell inequalities: the number of 

non-n-partite linear functions /(s). We are able to construct an infinite number 
of non-trivial Bell inequalities parametrized by 7r(s) utilizing a computational 
perspective on Bell tests. Crucially though, the non-trivial Bell inequalities in 



(3.52) are not dependent on an NLG construction, they exist outside of NLG. 
More specifically, the probability distribution 7r(s) is just a positive, non-zero 
weighting on correlators for a particular s. In the context of NLG, the inequalities 



in (3.52) are related to the success probability of the game, but outside of this 
context we still have an infinite number of non-trivial Bell inequalities. 

So far our discussion has applied to all possible scenarios (n, c, d) and we have 
stated general results for all these scenarios. In the discussion in the next subsec- 
tion we will focus on a particular example of non-trivial Bell inequality in (n, 2, 2) 



for all n of the form in (3.47). This example is a generalisation of the CHSH 
inequality to n parties. In contrast to the CHSH inequality, we will show that 
the upper bound of the quantum correlators is no better than the LHV upper 
bound. 



3.3.3 (n,2,2) scenario and the n-partite NAND function 

It is natural at this point to question the motivation for finding non-trivial Bell 
inequalities when facet Bell inequalities are more useful for determining the con- 
sequences of LHV correlators. As well as the motivations from computational 
complexity that finding facet Bell inequalities is hard, the utility of bounding C 
and connection to information processing tasks, we have further motivation in 
the (n, 2, 2) scenario. We show that in this scenario, facet Bell inequalities can 
be related to the non-trivial Bell inequalities described earlier. 
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As well as this general discussion, we will give an example of a non-trivial 



Bell inequality of the form (3.47) for the (n,2,2) scenario. This non-trivial Bell 
inequality for all n is a direct generalisation of the CHSH inequality. Essentially 
it is associated with a non-n-partite linear function, itself a generalisation of the 
function S1S2 ® 1 for n parties. We will use this function also to say something 



about MBQC in the section 3.4 We will show that for more than 2 parties this 
non-trivial inequality cannot be violated by quantum correlators. 



Firstly, we rewrite the non-trivial Bell inequalities in (3.49) in the specific 
(n, 2, 2) scenario as 



.l)/(s)+yi|s) <maxV^(s) 



(3.53) 



but now we allow probabilities < 7r(s) < 1. If we allow probabilities 7r(s) G {0, 1} 
then we might not have non-trivial Bell inequalities as we can choose probabilities 
such that 7r(s) = when /(s) 7^ g{s) and non-zero otherwise. In this instance, 
TT{s)6g^^^ = 1, hence jc = Iv and we do not have a non-trivial Bell inequality. 
However, for a choice of function /(s) and probability distribution 7r(s) we can 
construct a facet Bell inequality. Since all prefactors /3s of a facet Bell inequality 
are real, then they can be rewritten as /3s = | /3s | sign (/3s). We now fix values as 
7r(s) = ^^i'^ I and sign(/3s) = (—1)-^'^®)+^, then we multiply both sides of ( |3.53[ ) 



with |/3s| to obtain the inequality of the form in (3.1). If /(s) is an n-partite 



linear function in (3.53), then we cannot define a non-trivial Bell inequality and 



so cannot define a non-trivial, facet Bell inequality. Finding the facet Bell in- 
equalities for (n,2,2) is then a case of finding a probability distribution 7r(s) 
where we satisfy the facet-defining condition. 



For example, the inequality (3.25): 



- [p(l|000) -Fp(l|001) -hp(l|010) -Fp(l|011) 
+^ b(l|100) -hij(l|101) -Fp(l|110) - 3p(l|lll) 



< 1, 



(3.54) 



can be rewritten in the form of (3.53) with 7r(s) 
3 



for all s / {1, 1, 1} and 



vr s 



10 



for s = {1,1,1} and /(s) = S1S2S3 ® 1. With these substitutions 



(3.54) can be retrieved from (3.53) but now both sides of the inequality in (3.54) 
are multiplied by j^. Now, max^g7r(s) \ — <^/(s) ' = to so that 



10 



10- 



The latter result can be seen from the fact that the n-partite linear function 
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g{s) = 1 overlaps with /(s) for 7 values of s; these are the inputs s ^ 1. We have 
briefly shown that at least one of the non-trivial Bell inequalities as described in 



Proposition 12 is also a facet Bell inequality. 



We have shown that for a function /(s) we can find a probability distribution 



7r(s) where the resulting non-trivial Bell inequality in (3.53) is a facet Bell in- 
equality. We now take a different approach and fix the probability distribution 
to be 7r(s) = for all inputs s. This can be seen as the probability distribution 
of the Bell test being an NLG with inputs chosen randomly. 

Given this probability distribution, we consider a function /(s) for all n in 
{n, 2, 2). This function is a natural generalisation of the function /(s) = siS2 © 1 
corresponding to the function defining the CHSH inequality. This function will 
be discussed later with reference to quantum computing and so we define it now. 

Definition 6. The n-partite NAND function is /2(s) = YVj=i ® ^ acting 
on bit-string s G Zg. 

A NAND function is defined on two bits si and S2 as /(s) = si,S2 ® 1. This is 
exactly the function that we used when describing the CHSH inequality earlier in 
this chapter. The NAND function is the negation (or NOT) of the AND function 
/(s) = S1S2 |Papadimitrioul994| . The n-partite NAND function consists of the 
entire NOT of a number of AND functions between variables in s. What is 
clear is that it is a non-linear Boolean function due to the multiplication between 



elements of s. It is also the function describing the facet Bell inequality (3.54) 
above. 

For the n-partite NAND function and uniform probability distribution we ob- 
tain a non-trivial Bell inequality of the form 



on ^ 



^^(^)+V(l|s) < (3.55) 



The upper bound on the right-hand-side is due to the fact that ^g{s)^ ~ (2"— 1) 
if g{s) = 1, and all linear Boolean functions g{s) are never always equal to /2(s). 
We have shown that this inequality for n = 3 is related to the facet Bell inequality 



(3.54), and this relation extends to all n in (3.26). We now show that this natural 
generalisation of the CHSH inequality has no quantum violation whatsoever for 
n > 3. 



Proposition 12. The non-trivial Bell inequality (3.55) for the n-partite NAND 



function for uniform probability distribution 7r(s) = 2 " for all s is not violated 
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by quantum correlators for n > 3. 



Proof: The quantum upper bound for the inequaUty (3.55) can be calculated 



from (3.32) to obtain: 



2^(-l)-^2(s)+y;L|s) = sup 



^(_l)/2(s)+ie»{E"=i^. 



2" - 2 + sup 



^(_l)/2(s)+ie*{E; 



(3.56) 



If there is a quantum violation of the inequality in (3.55) then the following 
relationship must be satisfied: 



sup 



> 2" - 2 



(3.57) 



We may, without loss of generality, restrict 9j to the range 9j G (— 7r,7r). We 
simplify inequality (3.57), using the fact that (— l).^2(s)+i _ ^ £qj. bit strings s 



except when s = 1, to write 



sup 



s k=l 



sup 



2-n 



cos I ^ 1 - 2e'^kSk 



> 2" - 2. 



(3.58) 



We now adopt a geometric argument. The goal is to maximize the modulus of a 
sum of two complex numbers. These numbers may be represented, on the plane, 
as two sides of a triangle. The first side has length 2" 0^=1 cos( y)) the second is 
of length 2 and the angle between these sides is J^k complete the proof 

by showing that when n > 7, the length of the third side of the triangle can never 



exceed 2" — 2, and hence (3.58) is never satisfied. 



We proceed by assuming the opposite of what we want to prove and demon- 
strating a contradiction. Via the triangle inequality, for this inequality to be 
satisfied, the length of the base of the triangle must be greater than 2" — 4, and 
thus 



n 



cos I y I > 1 - 2 



2-n 



(3.59) 



Since 9j G (— tt, tt) all terms in the product are non-negative, hence we can impose 
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the weaker condition for (|3.59|) that V0j, cos ( ^ ) > 1 — 2^ This imphes that 



2~n\ 



I^j=i ^1 < narccos(l — 2 
Proceeding geometricahy, we now use the cosine rule to express the third side 



of the triangle (representing the modulus in (3.57)), and this expression must 
satisfy 



4 + 22"[]cos' 



2 / 1^ 

2 



2"+2cos 




cos 



> 4 + 2^n-2 



n+2 



(3.60) 



to obtain a quantum violation. Since 9j G (— 7r,7r), nr=i^os(§") non-negative, 
and hence a violation can only be achieved if cos(^^^]^ ^) is negative which 
implies | Yl^=i %l > f • Using this, we achieve 



narccos(l - 2^"") > 7r/2 



(3.61) 



or equivalently 



cos 



2n 



> (1 



i2-n\ 



(3.62) 



This inequality is only satisfied for integers n < 7, hence, due to the contradiction 
with our initial assumption, for n > 7 the quantum and classical bounds of the 



non-trivial Bell inequality (3.55). Direct numerical verification of the bounds. 



via equation (3.32) for n < 7 indicates that the bounds coincide for all integer 



values 3 < n < 7, thus completing the proof. □ 



This proof demonstrates that, Q is smaller than V for this scenario. It also 
demonstrates that quantum correlators are not always useful in every non-local 
game. If we modify the probability distribution 7r(s) by weighting the input 
s = 1 more than other inputs, we can regain a quantum advantage as with the 



inequality in (3.26). 



This example of a non-trivial Bell inequality not being violated by quantum 
mechanics is not isolated. For example, the following non-trivial Bell inequal- 
ity for (2,3,3) corresponding to the function /(s) 
probability distribution 7r(s) = ^: 



[sfs2 + with uniform 



(p(l|00) +p(l|01) +p(l|02) +p(110) +p(2|ll)) 
+ ^ (p(2|12) +p(l|20) +p(2|21) +K2|22)) < § 



(3.63) 
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is also not violated by quantum correlators Q. This upper bound of | was found 
using both the MBS and SDP approach. Just like the n-partite NAND function, 
the function /(s) = \s\s\ + ijg differs from all possible n-partite linear functions 
for only one value of s. 

We have shown that quantum resources are not always better than classical re- 
sources when trying to maximize the mean probability of winning an NLG. This 
is not new as Linden et al jLinden2007j devised a model of "non-local compu- 
tation" where quantum resources do no better than classical, or LHV resources. 
The resulting Bell inequality defining over probabilities p(m|s) from this model 
is also not facet-defining. Perhaps more interesting, Almeida et al found an NLG 
where quantum resources do no better than classical resources |Almeida2nT?1] : 
and for n > 3, this game defines a facet Bell inequality of £f for probabilities 
p(m|s). Investigating the limitations of quantum correlations therefore seems to 
be just as interesting as finding its advantages. 

We will use this function in the next subsection to say something about MBQC 
|Raussendorf200l] IRaussendorf200"3] . In particular, we look at a restricted class 
of computations in MBQC and map this class into the framework of Bell tests. 
We employ the Bell test as an NLG but in the language of games, there is 
a "promise" on the inputs }Cleve2004] . That is, the inputs are the result of 
some pre-processing on a bit-string |Anders2009) . This pre-processing has a 
well-defined role in MBQC and we use our NLG to show that our restricted class 
of MBQC is not equivalent to a universal Quantum Computer. The key to all of 
these insights is the computational perspective of the space C 

3.4 Non-adaptive Measurement-based Quantum 
Computing 

MBQC as formulated by Raussendorf and Briegel |Raussendorf200T] has been 
one of the great breakthroughs in quantum computing. Whereas the origi- 
nal circuit model of quantum computing requires the ability to perform uni- 
tary operators over the length of the computation |Nielsen2000j . MBQC reduced 
this to state preparation and sequential single-site (single-qubit) measurements 
|Raussendorf200l] . The state that is prepared is a multipartite entangled state, 
e.g. the "cluster state" |Raussendorf2003] . We immediately see that MBQC is 
more in the vein of a Bell test, which (for quantum correlators) consists of the 
preparation of a potentially entangled state and then single-site measurements 
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Figure 3.1: Non-adaptive MBQC consists of pre-processing on data, this data 
is then sent to the parties who make a single round of measure- 
ments. The classical control computer as well as performing pre- 
processing, processes the measurement outcomes. (Copyright: Insti- 
tute of Physics, 2011). 



on each part of this state. In this section, we show that the connection is more 
concrete than just this superficially shared language. 

Briegel and Raussendorf showed that adaptivity is a key component of their 
formulation of MBQC |Raussendorf200H IRaussendorf2003j . in order that all pos- 
sible quantum circuits are implemented deterministically. A natural question is 
what happens when we remove adaptivity? If we do not have adaptivity, then all 
measurements can take place simultaneously. This also simplifies the technolog- 
ical implementation of an MBQC, where a state only needs to be prepared and 
then measured instantly. Adaptivity means that a state needs to be stored for a 
non-negligible amount of time between measurement rounds. 

We now define the class of computations in MBQC without adaptivity or as we 
will call it, nMBQC. We do not place restrictions on the measurements, state 
prepared or number of sites. The element which remains the same in Raussendorf 
and Briegel's formulation of MBQC is the control computer [Raussendorf2003] 
IAnders2009j . The control computer can only implement XOR gates, or addition 
modulo 2, on bits |Anders2009| (see section 1.3 and 1.4). However, the necessity 
for particular measurements and states is not as well-defined; the cluster state is 
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an example of a useful resource |Raussendorf2003] , but there are other examples 
|Hein2005l IVandenNest2006| . We now define nMBQC as an abstract model. See 
Figure 3.1 for an accompanying schematic of nMBQC. 



Definition 7. The model of Non-adaptive Measurement-Based Quantum 
Computing, or nMBQC, involves the preparation of an n-partite quantum state 
{ip) and a classical control computer C. The computer C receives a bit-string x 
of length |x| with uniform probability ^j^^. The control computer performs arbi- 
trary XOR gates on a bit-string x and communicates the choice of measurement 
Sj(x) to each jth site. There is a single round of measurements on all n non- 
communicating sites. The control computer receives the measurement outcomes 
from each site as bits mj and computes the parity of m: rrij- 

The goal of this model is then to deterministically perform some Boolean 
function /(x) efficiently on the original bit-string x for all x. By efficient, we use 
the computational complexity convention that the amount of resources, in this 
case n, is polynomial in the size |x| of the input x. We can ask what the worst- 
case number of resources, or measurements sites to perform this function so that 
the function can be performed for all instances of x. In our definition above we 
add a uniform probability distribution on all inputs. This uniform probability 
distribution becomes relevant if we cannot perform a function deterministically, 
but want to maximize the probability of performing a function. The uniformity 
condition on all instances of x means there is no bias on any particular bit-string 
X, since it may be easier to compute a function /(x) for particular instances of 
X. Some results in the following discussion (such as [131 ) ^° require us to 
consider a distribution at all but we introduce it to cement a connection to Bell 
inequalities later on. 

We have so far not mentioned any constraint on how our resource is con- 
structed. Perhaps our resource can only be produced using exponential quantum 
computations. We place no constraint and just assume that the resource quantum 
state is "presented to us" and we make measurements on it. In fact, the optimal 
resource for nMBQC can be generated efficiently by a quantum computer. We 
will show that the optimal resource for all computations in nMBQC is the GHZ 
state, and this state can be generated efficiently |Hein200"5] . 

The hope is that even in this model of nMBQC we might be able to per- 
form (at least) all efficient classical computations (i.e. in the complexity class P 
|Papadimitrioul994| ) efficiently. In this section, we show that this is not possible 
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and the model is quite limited. Even then, this model can simulate the statistics 
of Clifford circuits |Raussendorf2003] IJozsa2006] , which are not believed even to 
be universal for classical computing |Aaronson2004j even though they produce 
entanglement. Computations in nMBQC are also in a recently studied class of 
limited quantum computations called "Instantaneous Quantum Polytime" (IQP) 
|Shepherd2009| . It is possible that IQP is not capable of simulating a full quan- 
tum computer, but IQP circuits are also not believed to be simulatable efficiently 
with a classical computer |Bremner2011j . 

We present an analogous result to that for IQP, but for nMBQC where there 
are functions that can be performed with greater mean success probability with 
quantum than classical resources. This result is in terms of computational ex- 
pressiveness, rather than computational complexity. It also results from the fact 
that the model of nMBQC can be expressed as an NLG |Hoban2011aj . Non- 
trivial Bell inequalities can be derived from these games, and a violation of these 
Bell inequalities implies a computational advantage with quantum resources in 
nMBQC. 

3.4.1 nMBQC, NLG and non-trivial Bell inequalities 

We now formalise nMBQC and consider the tools required for our analysis. 
Firstly, the "goal" of nMBQC is to perform functions /(x) both deterministi- 
cally and efficiently in |x| for all instances of x. This means that the mean 
success probability p(0"=i rrij = /(x)) of performing a function is 

n ^ n 

Pi® ^3 = m) = ^lY.P^®'^^ = /Wl^) = 1- (3.64) 

j = l X j = l 

If this value is less than unity, a function /(x) cannot be performed determinis- 
tically. We now relate this probability to the correlators p(l|s), i.e. the statistics 
of obtaining ©"=i "ij = 1 given inputs s. In nMBQC, the inputs sj are lin- 
ear Boolean functions in x, i.e. XOR gates performed on elements of x. Also, 
without loss of generality, we consider inputs Sj being of the form ©|,^li ctjXj ® b 
for aj £ {0, 1} and 6 = 0. If 6 = 1, each site can remove this constant from 
their input. We can then relate the bit-string s to x by an |x|-by-n matrix P 
representing the linear transformations on x in mod 2. That is, every string s 
can be expressed as 

s = (Px)g3 (3.65) 
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where (...)© represents matrix multiplication modulo 2. The strings s and x are 
then n- length and |x|-length column vectors respectively. If we use the example 
from the introduction with 3 parties and have the input on the third party being 
•S3 = si ® S2, with the choice of measurement on site 1 and 2 being si and S2 
respectively. If we fix si = xi and S2 = X2 then P = [0 j]- Therefore, for each 
computation in nMBQC we fix the matrix P that designates the computation 
performed by the control computer C. The matrix P does not contain a row 
consisting of all-zeroes so there sj is always dependent on elements of x. 

The figure of merit mj = /(x)) can now be expressed in terms of 

correlators p(l|s) to obtain the following: 

Pi^rn, = /(X)) = ^ EE'^(Px)e ('^1'^''^®' + (-l)^^''^®V(l|s)) . (3.66) 

j = l X s 

We can immediately see that the right-hand-side is of the form of a non-trivial 
Bell inequality for (n,2,2). The probability distribution 7r(s) is 2R'^^Px)0 
but with the sum now over x instead of s; x is however uniquely related to s. 
The function /(x) is then precisely the function in a non-trivial Bell inequality. 
This non-trivial Bell inequality is then 

^EE*(p.).((-i)'""»'f(iN)) < i--EE*(p.).te'-*?M) 

X S ^ ' X s 

^ ^7x5^E«Jx1-^/(x)) (3-67) 

where g{x.) are all possible linear Boolean functions on x. Since linear Boolean 
functions are g{s) = ©"=1 ajSjQb for aj, b £ {0, 1} and Sj = [(Px)^]j is a linear 
Boolean function on x. Therefore g{s) becomes an arbitrary linear Boolean 
function on x being g{x). 



Crucially the right-hand-side of (3.67) is independent of the number of sites 



n and is always strictly less than 1 — ^ Z]x'^/{x) f^-^^ being a non-linear 
Boolean function. This latter fact also means that Pi^^j^inij = /(x)) < 1 for 
classical resources. If /(x) is linear then p(0"=i rtij = /(x)) = 1 for classical 
resources; this is because the only deterministic correlators p(l|s) = 6(^^^ possible 
in LHV theories are for the linear Boolean functions /(x). Raussendorf has 
also shown that classical, or more generally, "noncontextual" resources can only 
perform linear Boolean functions deterministically in MBQC with a single round 
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of measurements |Raussendorf2009] . 

We now focus on using quantum resources in nMBQC. If quantum resources 



are more useful in nMBQC than classical resources then the inequality in (3.67) 
is violated by quantum correlators. Also if i5(©?=i "i-j = /(x)) = 1, then from 



(3.29) 



sup 



2l''l+i 



(eE*7p.,.(-i)'"") 



+ 



X s 

= ^ " ^ E E ^{Fyc)Jn^) 



(3.68) 



We make the substitution of (— = 1 — 26^^^^ to obtain 



1. 



This condition reduces to (-1)/W®i = ei(T.]=i[iP^)<sh9,) ^Yiere Sj = [(Px)e]i. 
In the following result, we indicate that this condition can always be satisfied if 
n is at most equal to 2^^^ — 1. Therefore, all Boolean functions can be performed 
deterministically with quantum resources in nMBQC. The issue of efficiency will 
be discussed further into this section. 

Theorem 13. Every Boolean function f{x) can be performed deterministically 
in nMBQC for at most n = 21^' — 1 parties. 

Proof: As mentioned, every function /(x) can be achieved deterministically if 
(_l)/(x)ffii equal to e^^^J^i^^^''^®'^^^^. This will be satisfied if each expression 
Yl]^=i[(^^)®]j^j for X 7^ is linearly independent from every other expression 
corresponding to each s. To show this we just need to establish that all vectors 
s = (Px)® are linearly independent over M. In other words, if we construct the 
2lx| — 1-by-n matrix S where rows are the vectors for s 7^ 0, then S must have 
rank 2^""^ - 1. 

Every column of S has elements g{x) where ^(x) = o.jXj is a linear 

Boolean function. We showed in Lemma [9] that 2^^^ — 1 vectors which have the 
elements being a different linear Boolean function of this form are linearly inde- 
pendent over R. Therefore if each column of S corresponds to a different linear 
Boolean function g{x) = 0"=i ajXj on x, then the rank of S is 2l''l - 1. □ 
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Therefore, with quantum resources a function /(x) can be computed. However, 
this result has only upper-bounded the resources n required to compute functions 
and this upper bound is inefficient. We now show that this upper bound is tight 
for all possible functions by using the example of the n-partite NAND function. 
We then use the following result to show that adaptivity is a crucial ingredient 
in MBQC. 

Theorem 14. The n-partite NAND function can only be performed determinis- 
tically in nMBQC for n = 2l^l — 1 parties. 

Proof: For ease of calculation, we prove this theorem for the n-partite NAND 
function with a NOT on each element of x, i.e. /(x) = Hjiil^i © 1) ® 1- 
However, the two functions are equivalent in our model as the control computer 
C can perform a NOT operation on each element of x. We prove this theorem 
by assuming that this function can be performed deterministically with 2^^^ — 2 
parties, and then obtain a contradiction. We first simplify the proof, instead of 
considering all possible 2^^^ — 2-by-|x| P matrices, we only need to consider one 
particular matrix Q. This matrix Q is the matrix with all rows being bit-strings 

|x| 

not equal to either or 1 . Any matrix P not equal to Q can be turned into Q 
in the following way (HPM)® where H is a 2^^^ — 2-by-2l''l —2 permutation matrix 
and M is any binary, invertible |x|-by-|x| matrix. This is because P 7^ Q then P 

|x| 

contains a row equal to 1 and does not contain a bit-string y G Z2 • Therefore 
we use M to map y to 1 by right multiplication (My)^ = 1 and (PM)^ contains 
all the same rows as Q but not necessarily in the same ordering. To establish 
the same ordering we left-multiply (PM)0 by the permutation matrix H so that 
(nPM)e = n(PM)e. 

The permutation matrix H just is equivalent to permuting all n parties and 
so will leave the probability of performing a function invariant. If the function 
/(x) = Hjiil^i © 1) © 1 is performed deterministically with quantum resources 
then e^(^i=i[(^'')®]^^-') = 1 for x = and 

for ah X / 0. Since (PMO)© = (PO)© then e*(^?=i[(^'^'')®]^^^) = 1 for x = 0. 



The set of strings {x|x 7^ 0} is equal to the set {(Mx)0|x 7^ 0}, then (3.69) 
is satisfied for both P and (PM)©. Therefore satisfying determinism for one 
matrix such as Q is equivalent to satisfying determinism for all P matrices. 



Ill 



We now show that deterministicany performing /(x) = njii(^i © 1) © 1 is 
impossible for the matrix Q. First we observe that if determinism is satisfied 



then from (3.69) we must satisfy 



^[(Qx)e]i^, =^(2tx + l), 



(3.70) 



for all X 7^ where ix is an integer. We can construct a sum over all bit-strings 
X (including 0) which alternates in sign: 



2I' 



E 



j;(-l)'^W[vr(2tx + l)], 



(3.71) 



where W{x) is the Hamming weight |MacWilliamsl977] of x, i.e. the num- 
ber of non-zero elements of x. We collect terms that have the same Hamming 



weight VK(x) on the right-hand-side of (3.71) and set y = VF(x). Defining 



uy — Y2x-w{x)=y^^ then the right-hand-side of (3.71) is equal to: 

|x| 



where t = Yl^yli{—^)^ty is some integer. 



We now show that the left-hand-side of (3.71) is actually equal to zero, thus 



leading to the contradiction that 7r(2t — 1) = indicating that (3.69) is not true 
for all X 7^ and determinism is not achieved. We can express the jth element 



of the sum in (3.69) as 



|x| 



5:(-i)^w[(Qx)eb = j:(-i)^^''M ® 



(3.73) 



, fc=i 



where Qj^k the element of Q corresponding to the jth row and kth column. 
Since every row of Q has at least one element Qj^k = for a particular value 
of k, then for two bit-strings x' and x" that only differ in their kth. element, 
(©fcli Qj,kx'i,^ = (©fell Qj,kx'l^ ■ However, the two Hamming weights T^(x') 
and Ty(x") corresponding respectively to these bit-strings differ by 1 resulting in 
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(_l)H^(x') Q^. ,4 j + (_i)Ty(x") Q^. j ^ 0. (3.74) 

Since all 2^^^ bit-strings x can be paired into bit-strings that differ by one element, 
then (3.73) must be equal to zero. Therefore, we have reached a contradiction 
and deterministic computation of /(x) = njii(^i © 1) © 1 cannot be achieved 
with less than 2^^^ — 1 parties. □ 



This result says that determinism in nMBQC comes potentially at the price 
of an exponential overhead in resources. Contrast the above result with the fact 
that the n-partite NAND function can be implemented deterministically and 
efficiently by a classical computer. In fact, we do not need the full computing 
power of P, but a smaller complexity class called NC^ which is contained in P 
|Papadimitrioul994| . Since a quantum computer can implement all computations 
in P, we have the following corollary. 

Corollary 2. It is impossible to efficiently achieve universal quantum computa- 
tion deterministically in nMQBC. 

It is interesting that Bell tests, and in particular, non-trivial Bell inequalities 
have something to say about quantum computers. We know that Bell tests 
have a role in quantum cryptography and communication complexity; they now 
have some role to play in quantum computation. This relationship between 
foundations and applications of quantum physics is not unidirectional. We now 
discuss in the following subsection, how these results for nMBQC say something 
about Bell tests and correlations. In particular, they convey generalisations of 
the GHZ paradox |GHZ1989] mentioned earlier and indicate that there exist 
generalisations of the PR box |Popescul994| that may not defined on all inputs 
s. 



3.4.2 Generalized GHZ Paradoxes and PR boxes 

The original GHZ paradox |GHZ1989] was constructed as a way to demonstrate 
the incompat ability of quantum physics with a LHV theory, but without the 



use of a Bell inequality. In the original paradox as discussed in section 1.2 the 
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following outcomes, translated into correlators: 



p(l|000) = 1 
p(l|011) = 1 
p(l|101) = 1 



(3.75) 



in an LHV theory deterministically predict that p(l|110) = 1. These statistics 
belong to the statistics of an extreme point of L. However, measurements on a 



GHZ state lead to a contradiction where the expressions in (3.75) are satisfied 
but p(l|110) = 0. The LHV statistics in (3.75) result in a value of 2 for the Mer- 



min inequality (3.13), but the quantum statistics result in a maximal algebraic 



violation of 3. Therefore our result in Theorem 13 can result in a GHZ paradox 
for 2l''l — 1 parties. We can assign the following statistics in an LHV theory: 



p(l|s; s = (Px) 



1, 



(3.76) 



for all X ^ 1 and P is the 2l''l — l-by-|x| matrix with rows consisting of all 
2l^l — 1 bit-strings y G zL^' not equal to 0. If we put these statistics into the non- 



trivial Bell inequality in (3.67) corresponding to the n-partite NAND function, 
we obtain: 



REE^(Px)e(-l)^^^''^"Vl|s) 



2l''l 



< 



1 

2W 
2l^l - 2 



2l^l - 1 -j5(l|s;s = (PI) 



(3.77) 



then for LHV theories we can only assign the probability p(l|s;s = (Pl)^) = 1 
deterministically. However, since with 2l''l — 1 parties, we can perform the NAND 
function deterministically with quantum mechanics, we can satisfy both the prob- 
abilities in (3.76) and p(l|s;s = (P1)0) = 0, leading to a contradiction. 

We did not need to make this argument utilising a Bell inequality as we could 
have just used the statistics of the LHV correlator producing the linear Boolean 
function /(x) = 1 deterministically. This deterministic correlator is the only 



correlator that satisfies all assignments in (3.76). In this sense then, we have a 
GHZ paradox for all choices of |x|. 

Finally, when we introduce the pre-processing on inputs s = (Px)^ and con- 



struct a non-trivial Bell inequality of the form in (3.67) then we do not consider 



all possible correlators p(l|s) but only those correlators where s is defined by 
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X and P. As a result we only consider probabilities p(m|s) that also satisfy 
this relationship between s and x. We can consider non-signalling probability 
distributions p(m|s) that are of the following form 



p(m|s; s = (Px) 



2^ if ©"=i"ii = /(x), ^^^g^ 
otherwise, 



for any non- linear Boolean function /(x). We are not concerned with inputs s 
that do not satisfy s = (Px)0, therefore, these distributions are not necessarily 
extreme points of MS. The distributions may even be in the interior of MS but 
can be perceived as a generalisation of the PR box |Popescul994| , due to the fact 
that they maximally violate a Bell inequality for all correlators. 

For example, the Mermin inequality is maximally violated by correlators re- 
sulting from a GHZ state, but we can also achieve the same maximal violation 
with vertices oi MS. The correlations p(m|s) that result from the GHZ state do 
not form a vertex of MS. For s ^ {{000}, {Oil}, {101}, {110}}, the correlations 
j»(m|s) resulting from the GHZ state do not resemble those of extreme points in 
MS. 



What Theorem 14 implies, is that even though (3.78) is defined on a subset 
of inputs s, there exist non-signalling probability distributions for n < 21^' — 2 
that cannot be achieved by quantum mechanics. More specifically, if /(x) in 



(3.78) is the n-partite NAND function, since quantum physics cannot achieve 



this distribution for these values of n, they are as "unphysical" as the PR box. 



Theorem 14 also implies that there are generalised PR boxes that can effi- 
ciently perform the n-partite NAND function in our nMBQC model. The fact 
that these unphysical resources can efficiently perform tasks unthinkable with 
physical resources has been analogously investigated in the field of communi- 
cation complexity. An argument put forward first by Van Dam |vanDam2000j 
and then developed by Brassard et al |Brassard2006j . is that if these unphysical, 
bipartite PR boxes exist then tasks in communication complexity are rendered 
"trivial". By trivial, we mean that only one bit of communication is required 
between two parties to achieve all Boolean functions. These ideas were also ex- 
tended to the multipartite scenario |Marcovitch2008j . It could be argued that the 



result of Theorem 14 complements the idea that quantum mechanics cannot sim- 
ulate all non-signalling probability distributions because information processing 
would be rendered "too easy". 
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In this section we have discussed the interplay between the computational 
perspectives on Bell tests and computation itself. In particular, we looked at a 
restricted class of computations in MBQC, itself a promising avenue for quantum 
computing. We have used Bell tests to show that adaptivity is crucial in Briegel 
and Raussendorf 's MBQC scheme |Raussendorf200l] . With adaptivity comes the 
possibility for parties to communicate to each other and the connection between 
computation and Bell tests can break down. In the next chapter, we hint at a 
method to re-establish this connection. 

3.5 Chapter Summary 

When Bell first formulated his inequality he wanted to say something concrete 
about the interpretation of the wavefunction [Bell2004] . He established that if 
quantum mechanics is to be re-imagined as a local hidden variable theory, then 
a great deal of the theory's predictions would have to be "thrown out". Classical 
physics can be conceived as a local hidden theory, so there is an incompati- 
bility between classical physics and quantum physics. This incompatibility is 
"witnessed" by a Bell inequality: a violation indicates incompatibility. It imme- 
diately tells us that quantum systems can do something that classical systems 
cannot. 

It could be argued that it was inevitable that this tool for disambiguation 
between classical and non-classical would be used to show that quantum cor- 
relations can perform some tasks that classical correlations cannot. With the 
development of quantum information theory. Bell tests were approached with a 
new motivation: to find a quantum advantage for some quantum information 
processing tasks. For example, the application of Bell tests to cryptography 
|Acm200"7] and random number generation |Pironio2010j has been successful. 

Quantum computation could produce an advantage over classical computers 
|Shorl997] . The proof that quantum computers are more powerful than classical 
computers would have an immense impact on the study of classical computa- 
tional complexity as it would provide a separation in a conjectured hierarchy of 
computational models |Papadimitrioul994] . Since the Bell test produces a clear 
cut distinction between quantum and classical, it could be considered a useful 
tool for proving this separation in computational models. The difficulty lies in 
communication, a resource not allowed in Bell tests, but not prohibited in most 
models of computation. 
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Immediately one can suggest that we study models of computation that do not 
require or even limit communication. Communication complexity is a model of 
computation that limits communication |vanDam2000j . and non-local games do 
not allow communication between players but to the referee |Cleve2004] . Connec- 
tions have been made to the latter with multi-prover interative proof systems, a 
model of computing based on the exchange of messages between parties in order 
to ascertain whether a potential solution to a problem is correct |Cleve2004] . In- 
teractive proof systems have been shown to be extremely powerful, potentially far 
more powerful than computations in NP depending on the model |Jain2010| . If 
we want to say something about classical and quantum computers, then in these 
"simpler" models we will still want to place restrictions on communication. This 
motivates our study of MBQC circuits where the only communication allowed is 
between a classical computer and measurement sties, sites cannot communicate 
with each other and there is a single-round of measurements. 

In this chapter, we began by discussing the space of LHV correlators in terms of 
the facet Bell inequalities. Finding facet Bell inequalities is hard and in practice 
we could only find them for a limited number of (n, c, d) scenarios. This motivated 
us to find a set of non-trivial Bell inequalities. These non-trivial inequalities were 
motivated by our computational insight into the space of LHV correlators, and 
were shown to be relevant for the study of non-local games (NLG). Finally, 
our restricted class of MBQC computations was shown to be cast as an NLG, 
and again made relevant to non-trivial Bell inequalities. Using the tools from 
the study of Bell inequalities, we showed that this restricted class of MBQC 
computations is not universal for quantum computing. However, in this model, 
due to the very nature of the Bell inequality, we showed that quantum resources 
can do something that classical resources cannot. 

We have shown that there are concrete connections between Bell tests and 
some models of computing. On the other hand, we have also shown that commu- 
nication in the form of adaptivity is vital for MBQC. In the next chapter, we will 
indicate how to simulate communication in computations within the framework 
of a Bell test. Perhaps surprisingly, this communication simulation still allows 
the possibility for disambiguating quantum and classical resources. 
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4 Data Post-selection in Bell Tests 



The Bell test has been around formally for decades. A natural question is 'can we 
go beyond this formulation?' Of course, situations altering the number of parties, 
inputs and outputs have been studied. Despite these generalizations, the core of 
the gedankenexperiment still involves space-like separated parties making their 
measurements and then sending their data to be turned into statistics. However, 
in reality, data does not always emerge perfectly from experiments, and often 
it needs to be discarded. CHSH took this imperfection into account and added 
an extra assumption to the construction of Bell tests beyond Bell's formulation: 
the "fair-sampling assumption" |(:HSH1969I IClauserlQTSl |Berry20l"0l . This as- 
sumption essentially states that the experimental errors in performing a Bell test 
are independent of the choice of measurement at each site. In the history of ex- 
perimental tests of Bell inequalities, this assumption has featured strongly, espe- 
cially in optical tests |Freedmanl972l IShihl988[ I()ul9881 |Rarityl9901 ITittell998[ 
IWeihsl998) . 

Whilst the fair-sampling assumption may be rooted in common sense, we can- 
not assume, in general, that it is true. However, if we relax it then the discarding 
of data can be problematic. In particular, it can lead to the "detection loophole" 
|Pearlel970| |Gargl987| as it is now often referred. A "loophole" emerges when 
some imperfection in the experiment can allow LHV correlations to simulate 
quantum correlations. There a several sources of loopholes in experimental Bell 
tests, some more subtle that others. 

Two central constraints on the construction of Bell tests are measurement 
choice independence and space-like separation. If the latter is not respected 
in an experiment, then parties can communicate and from this communication, 
simulate whichever correlations they wish. Bell has emphasized himself how 
important that choice of measurement be completely random and independent 
of the parties' systems |Bell2004] . Barrett and Gisin have directly related the 
lack of measurement choice independence to simulating communication between 
parties. These central stipulations of the Bell test must be upheld if we want to 
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restrict what is possible with LHV correlations. 

Modern, photonic-based Bell tests allow for space- like separated measurements 
|Tittell998[ IWeihsl998| . The issue of freedom of measurement choice can tend 
towards philosophy, and the concept of "free will". These discussions are well 
beyond the scope of this thesis. It could be argued though that photon Bell 
tests can also address the need for random choice of measurements |Weihsl998j . 
As discussed in section |1.4| of chapter [T| random numbers can be generated by 
quantum processes, potentially in a device independent manner |Colbeck2007t 
IPironio2010j . Experimental groups have exploited this source of randomness to 
produce random measurements jWeihsl998| . 

The issues raised by more systematic failures to implement Bell tests are prob- 
lematic. The detection loophole is a more subtle source of problems. It can be 
seen to result from a form of "post-selection". Here we use the term post-selection 
as a means of accepting measurement data if it satisfies particular criterisj^ In 
the case of imperfect detection where our measurement devices (detectors) may 
or may not receive a measurement outcome (detection event), we can only calcu- 
late correlations for all parties if all parties have made a successful measurement. 
Therefore, we accept or post-select on measurement data if all sites successfully 
detected a measurement outcome. In the first section of this chapter, we will 
formalise these ideas in the (n, 2, 2) scenaric]^ 

This chapter concerns itself more generally with data post-selection in Bell 
tests. In particular, we introduce two forms of post-selection and associate a 
loophole with each form of post-selection. In section |4.1| of this chapter, we dis- 
cuss the form of post-selection in the presence of imperfect detection, whereas in 



section 4.2 we consider post-selection in perfect Bell tests. By the latter, we mean 
that we have perfect detection, space-like separation and freedom of measurement 
choice (the original gedankenexperiment) but introduce a form of post-selection 
on accepting measurement data. Whilst the post-selection in section jlTT] is exper- 



imentally motivated, the post-selection in section 4.2 is very much conceptually 
motivated. Despite their differing motivation there is an overlap in the language 
we use to describe the loopholes. This language is rooted in our computational 
insight into LHV correlators. 



^Post-selection in quantum information can often mean the acceptance of a quantum state after 
measurement, if a particular measurement outcome is achieved. Otherwise the quantum 
state is discarded. 

■^These ideas can be extended to different scenarios, but for pedagogical clarity and the ease 
of producing new results we make this restriction. 
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Interestingly, whilst post-selection on successful detection can lead to the de- 



tection loophole (as we shall show), the post-selection in section 4.2 can be de- 
scribed as "loophole-free". As well as the latter constraining LHV correlators in 
the presence of post-selection, it can also enlarge the space of quantum correla- 
tors. We also indicate that connections can be made between MBQC and our new 
form of data post-selection. Finally in section [4^ we give some indications that 



generalising the results of section 4.2 to different (n, c, d) scenarios may become 



problematic, and no longer loophole-free. 

The original work in this chapter was developed in collaboration with Dan 



Browne. Section 4.1 (except subsection 4.1.3) is a rederivation of the work of 
Garg and Mermin in f Gargl987| , but now in our computational description of 



Bell tests. Subsection 4.1.3 consists of a new result generalising the work of Garg 



and Mermin to n parties. In section 4.3 all of the work was completed also in 



collaboration with Joel Wallman. Results in section 4.2 have been published 



as |Hoban2011bj and some of the results in section 4.3 have been published in 
|Hoban201l"^ . 



4.0.1 Notation 

In this chapter, we will carry over the notation convention for modular arith- 
metic introduced in the last chapter. The first two sections of this chapter solely 
consider the (n, 2, 2) scenario and so we use © and to denote addition and 



summation modulo 2 for only the (n,2,2) case. In section 4.3 we consider the 
{n,c,d) cases, and we enclose modulo x arithmetic in brackets, i.e. [■■■]^- For 
further clarification see section 3.0.1 of chapter [3j 



4.1 Post-selection and the Detection Loophole 

We know that the space of quantum correlators is larger than C by Bell's theorem. 
This is a mathematical statement and testing it in the laboratory has been a ma- 
jor endeavour and challenge in the past few decades |GHSH1969[ IFreedmanl972[ 
IShihl988l IUul988L |Karityl990t ITittell998L IWeihsl998l IRowe2001] . However, of 
these experiments, the majority have suffered from the detection loophole. Ex- 
periments such as |Rowe200T] that manage to overcome the detection loophole 
suffer from not having space- like separated measurements |Rowe200T] . There are 
currently no loophole-free Bell tests but there are promising routes for overcom- 
ing the detection loophole |Matsukevich2008l IVertesi20101 |Sangouard2011 . 
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The issue of imperfect detection, culminating in the detection loophole is a 
subtle issue |Pearlel970l |Gargl987| . In a full treatment of a Bell test, a non- 
detection of an event is in itself an event. That is, if a measurement is the 
result of a detection and there are d possible outcomes, a non-detection must be 
another outcome. We cannot rule out the possibility that an LHV theory can 
produce all {d + 1) outcomes. The fair-sampling assumption aims to exclude this 
possibility by saying that the non-detection event is independent of our choice 
of measurement |CHSH1969l IClauserl978| Berry20T0] . This assumption cannot 
itself be tested. For example, we construct an explicit LHV model that violates 
the fair-sampling assumption but the statistics of detection are random at each 
site. We cannot extract the dependence on s from the statistics alone. We do 
not therefore impose the fair-sampling assumption in our discussion. 

Having imperfect detectors does not necessarily mean that LHV correlators 
can completely simulate quantum correlators. Recall that this simulation is how 
we describe a loophole, but we shall make this notion more rigorous in subse- 
quent discussion. Work by Pearle |Pearlel9"70] which was then developed by 
Garg and Mermin Gargl98"7l showed that if the detector efficiency (the ratio of 
successful detection to all incoming events) at each site is above some threshold, 
then a loophole can be ruled out. This detection efficiency threshold has been 
subsequently lowered by further research |Eberhardl993t IVertesi2010] . 

A final, somewhat more applied, motivation for considering the detection loop- 
hole comes from quantum key distribution |Ekertl99T] . We discussed device- 
independent quantum key distribution |Acm200"7t IPironio2009] in section 1.4 of 
the first chapter. Recall that the security of device-independent quantum key 
distribution can be ensured by the violation of a Bell inequality. The intuition 
is as follows: an adversary trying to learn the generated secret key (thus able to 
decode any secret message) can learn it if the key is described by an LHV. The 
secret information is contained in some "local" information at each site which can 
be "extracted" by said adversary. If the secret information is generated by some 
correlations incompatible with an LHV theory, then an adversary cannot localise 
it and obtain it. The detection loophole allows an adversary to learn a secret key 
that can be generated by LHV resources via the loophole [Acm200"7] . 

We structure this section so that we introduce and describe the detection loop- 
hole. Our novel insight into this loophole is to use the language of computational 
expressiveness to describe what LHV correlators can do in the presence of im- 
perfect detection. We show that the post-selection of accepting measurement 



121 



data based on successful detection induces a relationship between each party's 
shared hidden variables and inputs sj. We use this discussion to derive the GM 
threshold detector efficiency, but also to generalise their result to n parties. We 
show that this threshold can be lowered by going from 2 to n parties. A previ- 
ous reduction in the threshold detector efficiency for (2, 2, 2) have resulted from 
considering the full probability distribution and not correlators |Eberhardl993j . 

In this section and the next, we will restrict ourselves to the study of Bell 
tests in the (n, 2, 2) scenario. Therefore we will use Corollary [l] of Theorem [2] 
where C is the convex hull of the linear Boolean functions. If a correlator cannot 
be written as a convex combination of linear Boolean functions for all possible 
decompositions it must lie outside of C. 

4.1.1 The Detection loophole 

The action of discarding data means that the person carrying out a Bell test is 
playing an active rol^ Because of this active role, throughout this chapter, we 
will refer to an "experimenter" who does something non-trivial with the experi- 
mental data. We will describe the role of the experimenter in different contexts 
in more detail throughout this chapter. That is, what the experimenter can and 
cannot do will be prescribed. 

How do we incorporate the issue of a non-detection event into an (n, 2, 2) Bell 
test? Since the number of outputs of a successful measurement is binary, then 
the total number of outcomes is ternary, i.e. mj £ Z3. What is an appropriate 
joint outcome, the sum modulo 2 of all outcomes, or the sum modulo 3? If we 
take the sum modulo 2 then a non-detection will necessarily get mapped to an 
event with a successful detection. Can we still talk in terms of Boolean functions 
if the number of outcomes at each site is ternary? Is a loophole is caused by de 
facto moving out of the scope of Boolean functions? 

We can resolve this discussion by redescribing the scenario only in terms of bit- 
strings. Now instead of each jth site outputting a single digit nij, they output 
two bits {tj,mj} G Z2 X Z2. Here tj is a bit that indicates whether an event is 
successfully detected (represented by 1) or not detected (represented by 0). If 
tj = 1 for all j, then the experimenter takes the sum modulo 2 of all outcomes 
rrij, if tj = for at least one site j, we throw away all data. The elements tj 

^With perfect detection, the experimenter only calculated the sum modulo d of outcomes. This 
can be seen as an active role, however, we take active to mean that they can do something 
non-trivial with the data. 
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make up an n- length bit-string t and we accept m if t = 1, the string of all- 
ones. This discarding of data is a form of post-selection; we call this method of 
post-selection when t = 1 "detection post-selection". 

Definition 8. When the experimenter accepts, or post-selects on data m and s 
when t = 1, this is detection post-selection. This data after post-selection is 
then used to calculate ©"=i mj. 

This action of post-selection as we shall show can be a way of introducing 
loopholes. Before we define a loophole we need to introduce the mathematical 
construction we need to define them. 

The convex polytope V is the space of correlators p(l|s) that are perfectly 
detected, i.e. t = 1 for all runs of an experiment. For imperfect detection, we 
need a new, more general space of correlators that are calculated after post- 
selecting on m and s when t = 1 . We call this more general space V and if every 
run of an experiment produces t = 1, then V = V. However, more generally, 
correlators are now defined in the following way 

n 

|5(l|s)=p(0m, = l|s,t = l). (4.1) 



V is now the space of correlators of the form (4.1). However, the space V for 
the (n, 2, 2) setting can be defined in an analogous way to V. That is, V is the 
convex hull of all correlators p(l|s) = 5^^^^ for any Boolean function / : — )• 
We have put no restriction on the probability of detection p(t = 1), only that 
t = 1. 

In the case for perfect detection, the space of LHV correlators is C as defined 
by Corollary[l| We define C as the space of correlators p(l|s) resulting from LHV 
correlators, computed after detection post-selection. Is the space C always the 
convex hull of linear Boolean functions on s? For perfect detectors where t = 1 
is always satisfied, then C = C Another way of asking this is to write the CHSH 
inequalities in terms correlators p(l|s), 

p(l|00) +pil\00) +p(l|00) -p(l|00) < 2. (4.2) 

If this inequality can be violated by correlators in C then the space C is no longer 
the convex hull of linear Boolean functions. We then associate this violation by 
LHV correlators (in the presence of imperfect detection) with a loophole in a Bell 
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test. We now define this loophole. 

Definition 9. A loophole is introduced by an experimenter into a Bell test if 
after detection post-selection, the space L is larger than the convex hull of linear 
Boolean functions. 

The intuition behind this being a loophole is that if we have a quantum corre- 
lator q (obtained with perfect detection) being outside of £, then it will violate 
a facet Bell inequality. However, if the detectors which obtained this quantum 
correlator become imperfect, then after detection post-selection, the resulting 
quantum correlator in V will agairQ be equal to q. The loophole means that q 
could now be in the space C. It is possible that q could be outside of £, but the 
facet Bell inequalities for £ are possibly no longer relevant for informing us either 
way. We now show that loopholes are achievable with detection post-selection. 
In the following result we show that it is possible that £ can no longer be confined 
to the convex hull of linear Boolean functions. 

Proposition 15. For all LHV theories, £ is larger than the convex hull of linear 
Boolean functions on s. 

Proof: We construct the following specific model with n sites. The (n — l) sites 
for j G {2,3, ...,n} have perfect detectors whereas the first site has an imperfect 
detector. The first detector outputs the detection bit as a function of an LHV 
A and its input, ti = si ® 6(A) © 1 so that 6(A) G {0, 1}, whereas t2 = 1. When 
we post-select so that ti = 1 then si = 6(A). The variable 6(A) is shared by all 
parties, and the second party's measurement outcome m2 = 6(A)s2. If for all 
(n — 1) sites where j ^ 2, the parties' measurement outcomes upon successful 
detection are rrij = 0, then when t = 1, ©"=i rrij = siS2- The resulting correla- 
tor is then i>(l|s) = (5^^^*^ = siS2, which is a vertex outside of the convex hull of 
linear Boolean functions. □ 

This demonstrates how post-selection can be problematic in Bell tests. A 
drawback of the proof of the above result is the asymmetry in the detectors 
between the first detector and the rest. If we were to switch the detectors in the 
experiment and still got the same imperfect detection at site 1 then the rate of 
detection must be independent of the detector. The measure of detection is the 

''We assume that the detection device is independent of the quantum state or choice of mea- 
surement made. 
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detection efficiency r] which is the quotient of number of successful detections to 
the number of events incoming to the detector. We can obtain the efficiency of a 
detector if two sites each make measurements, and then condition the statistics 
of the detector upon the other detecting an event so that 

pjh = l\t2 = 1) ^ P(t = {l,l}) .4 . 

^ P(,h=0\t2 = l)+p{tl = l\t2 = l) p(t = {0,l})+p(t = {l,l})- 

We assume that the detector efficiency rj is the same for ah n sites. Situations 
with non-uniform r] amongst parties have been investigated (e.g. |Vertesi2010| ) 
but is beyond the scope of our discussion here. 

In the early literature discussing the detection loophole (e.g. [PearlelOTOt 



Gargl98"7] ), the probabilities in (4.3) are calculated from the number A^t of events 
where t occured. It is assumed that the number of events where t = is unob- 
servable as they are non-events. Probabilities then become normalised relative 
to this inability to detect when t = and 

Pit) = ^J^. (4.4) 

Then for the above discussion about rj for 2 parties, A't being the number of events 
where t G {0, 1}^ occurs, the total number of events is = A''! + A''|o -|- A''|x^o}- 



The probabilities in (4.3) then are obtained in the limit where N ^ 00 giving 
the efficiency 

If we want rj to be the same for all sites then -/V{o,i} = -^{1,0}- We also now 
impose that the statistics p{tj = 1) should be independent of Sj. This is not as 
strong as the fair-sampling assumption and we can experimentally test whether 
single-site detection statistics are independent of Sj |Gargl987l . This reinforces 
the intuition that the properties of a detector such as r/ should be independent 
of whatever measurement we make. 

In line with previous research such as in |Gargl987] , we now weigh the corre- 
lation statistics with the statistics of detection. Therefore correlators now take 
the form 

p(l|s) =p(t = l)p(l|s). (4.6) 

These correlators p(l|s) are not necessarily normalised so j5(0|s) 7^ 1 — p(l|s) in 
general. On the other hand, p{t = l)p(0|s) = p{t = 1)(1 — p(l|s)). Expectation 
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values of outcomes in the space V can be defined in exact analogy with the 
expectation values E(s) over correlators in V, giving 

]E(s) = l-2p(l|s). (4.7) 

Expectation values ]E(s) for the correlators p(0|s) and p(l|s) can then be related 
to IE(s) to obtain 

E(s) = ^(0|s) - p(l|s) = p{t = 1)(1 - 2p(l|s)) = p{t = l)]E(s). (4.8) 

This relationship between expectation values will be utilised in the following 
section. In fact, because the correlators p(l|s) are not normalised, it will be 
more useful to work in terms of the expectation values E(s). This means we only 
need to consider one number instead of both p(0|s) and p(l|s). 

In the following two subsections we will work in the new space £e of the 
expectation values ]E(s) for LHV theories. In line with previous discussion, this 
space is a 2" dimensional real space of vectors having the elements ]E(s). These 
elements can now be negative but their magnitudes are bounded by unity. The 
space £e is a sub-space of which is now the space of all possible vectors of 
expectation values ]E(s). 

4.1.2 Rederivation of the GM detection efficiency 

We now address the (2, 2, 2) scenario and use it to give an upper bound on the 
detection efficiency r] required in order to demonstrate a violation of the CHSH in- 
equality in the presence of imperfect detectors. This upper bound was derived by 
GM |Gargl987] and has since been improved upon by Eberhard |Eberhardl993] 
in the Clauser-Horne inequality setting |CH1969] . As an aside, it has been sug- 
gested that if we consider different Bell test settings, we can lower the detection 
efficiency required to violate any Bell inequality |Vertesi2010) . 

We will use our computational interpretation of correlators to rederive the 
GM upper bound on the threshold detection efficiency ry. In order to do this, we 
ffist describe the space of expectation values ]E(s). The following result now 
captures this space in terms of a vertex description. 

Proposition 16. The space jCe is the convex hull of all expectation values 
]E(s) = (— l)^^*-' for g{s) being a linear Boolean function on s. 

Proof: First, just like measurement outcomes rrij resulting from LHV theories. 
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the detection values tj can be, in general, written as tj = Xj{X)sj (Byj{X) for bits 
Xj{X) G {0, 1} depending on the local hidden variable A. Therefore, if yj{X) = 1 
and Xj{X) = 0, then p{t = 1) = 1 for all j, X, otherwise if yj{X) = Xj{X) = 0, then 
p{t = 1) = again for all j, X. So then for all LHV maps where Xj{X) = for all 
j, X, the probability of detection for a single-site is p{tj = 1) = p{yj{X) = 1). 

Since sj is randomly generated, for Xj{X) = 1 for all j, X, then p(t = 1) 7^ 1 for 
all yj{X). Finally, the map tj = Sj or tj = Sj®l is forbidden as this means there 
is a direct dependence in the statistics of detection with the choice of input. We 
then instead have maps tj = Sj © yj{X) where yj{X) is shared by both parties 
and generated randomly so that p{tj\sj) = p{tj\s'j) for Sj 7^ Sj. As a result of tj 
being random, p{t = 1) is at most equal to 

If one party employs the strategy of tj = Sj(Byj{X) and the other site produces 
the deterministic map tji = 1 then p{t = 1) = ^. However, the detection 
efficiency r] is not the same for both sides. We can maintain the same probability 
p(t = 1) = I while making the detection efficiency the same for both sides if 
both parties share a random bit z{X) G {0, 1}. When z{X) = 0, = si ® ?/i(A) 
and t2 = 1, and when z{X) = 1, ii = 1 and t2 = si® yi(A). As z{X) is randomly 
generated then p{t = 1) = |(^ + |) = ^. If z{X) were not random then we bias 
one of the strategies and -/V^o,!} ^{1,0} for AT — )■ 00, which is forbidden. 

In this strategy where parties share ^(A), one of the parties learns the other 
party's input Sj as it is equal to a variable yj{X) © 1 when tj = 1. If one party 
learns the other party's variable then they can compute the non-linear Boolean 
functions /(s) = (si©a)(s2®6)©c for a, 6, c G {0, 1} deterministically. Therefore, 
the parties can achieve the post-selected expectation value: 

E(s) = (4.9) 

with /(s) being the above non-linear Boolean function. This gives a value of 

E(s) = p{t = l)E(s) = (4.10) 

We take the convex combination of LHV strategies producing all allowed deter- 
ministic maps tj = Xj{X)sj © yj{X) and then the possible deterministic values of 
]E(s) for each strategy. This then produces the expectation values: 

m = j:P9is){-iy^'^ + 1: ^(-i)^(^). (4.11) 

9(s) /(s) 
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with /(s) and g{s) being all of the non-linear and linear Boolean functions re- 
spectively. We have taken the convex combination with positive coefficients Pg(s) ; 
Pf{s) such that J2g{s)Pg{s) + Ylf{s)Pf(s) — 1- Thus £e is at least as large as the 
convex hull of ]E(s) = (— 1)^^®). If the expectation values in (4.10) are outside of 
this space then they will violate one of the CHSH inequalities 



5^(-1)/(^)e(s) 



< 2, 



(4.12) 



where /(s) can one of the non-linear Boolean functions /(s) = {si(Ba){s2(Bb)(Bc 
for a, b, c £ {0, 1}. If we use the strategy of allowing the maps tj = Sj © yj(A), 

< 4. This lack of violation for 



then p{t 



\ even though Es(-l)^^^-'lE(s) 



the CHSH inequalities therefore concludes the proof. □ 



This result will give an upper bound on the efficiency r\ required of detectors in 
order to establish that certain values of ]E(s) are not in £e. The result indicates 
the structure of £e is the same as £, and the CHSH inequalities are exactly the 
same, i.e. 



p(t 



E(00) + E(01) E(10) - E(ll) 
1) (E(00) -FE(Ol) -ME(IO) -E(ll) ) < 2. 



(4.13) 



If we assume that the values -E(s) are obtained from measurements on quantum 
systems, then the maximum quantum value of £'(00) -|- S(01) -|- £'(10) — £(11) 
is Tsirelson's bound, 2\f2. In order to demonstrate a violation of the inequality 



(4.13), we must then satisfy p(t 



We now relate the value of p(t 
following expression: 



1) to the detection efficiency r\ with the 



p(t = 1) = lim 



N- 



>c» iVi + Af|o,i} + ^^{1,0} 



lim = 

Af^oo iVi 2Af{o,i} 2-7? 



(4.14) 



since N. 



{0,1} 



N. 



{1^0}- A value of p(t = 1) > -i= thus gives ry > -^j^ ~ 0.8284. 
This is exactly the detection efficiency derived by CM |Gargl98"7] . 

In GM's result of ~ 0.8284, they use a Bell inequality derived for spin-0 
particles |Merminl982j . In this original work, it is perhaps not clear, in general, 
how a loophole is avoided or created. We have explicitly shown the mechanism of 
how loopholes are formed and this is due to the emergence of non-linear Boolean 
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functions in the event of post-selection. The beauty of our approach, as we shall 
show in the next subsection is that it can be generalised to n parties; something 
not immediately attainable in the GM approaclj^ In the following subsection, 
we describe this generalisation to (n, 2, 2) scenarios. 

4.1.3 Generalisation of the GM bound to Many Parties 

In the previous subsection, we showed that as long as detection efficiency is above 
some threshold then quantum physics can violate a Bell inequality. The threshold 
we derived was already attained by GM. Our rederivation makes the mechanism 
of loopholes very clear and also establishes the framework for generalising to 
more than two parties. In this subsection we now present a new result. 

We have shown that the GM threshold for detection efficiency is reached when 
the quantum systems achieve Tsirelson's bound. If the quantum systems do 
not achieve this bound then the detection efficiency needed to rule out an LHV 
description needs to be higher. That is if E{00) + £^(01) + E{W) - E{ll) = 2 + e 
where < e < 2(\/2 — 1) results from quantum correlators then the detection 
efficiency must satisfy rj > A natural extension of this result is to find Bell 
inequalities in other Bell tests where the detection efficiency required is lower. 
Then a bigger range of quantum values of a Bell expression can be tolerated and 
rule out an LHV description. 

This has also been investigated in the full probability distribution Bell setting 
(e.g. the Clauser-Horne Bell setting [GH1969j ) . For example, Eberhard showed 
that for the CH inequality the minimum detection efficiency is given by r/ > | 
|Eberhardl993] . This value has been subsequently lowered if one increases the 
number of measurement settings that one can choose from |Vertesi2010j . How- 
ever, we are focussing on the n-party setting with 2 inputs and 2 outputs; we 
will explore a generalisation of the derivation of the GM bound to the (n, 2, 2) 
setting and show that the threshold for ry decreases from t] « 0.8284. 

The intuition then is to find inequalities where the maximal quantum vio- 
lation is larger than for the (2, 2, 2) case. For the (n, 2, 2) case, WW have 
shown |Werner2001j that the quantum violation of the Mermin-Klyshko inequal- 
ities |Merminl9901 IBelinskiil9931 IGisinl998] (and inequalities in its orbit) is the 
largest violation for any (n, 2, 2) inequality. There is only one vertex of V that 
maximally violates this inequality (for n being even), as shown by Marcovitch 



^One would need to find the facet Bell inequalities for 3 or more spin-0 particles. 
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and Reznik |Marcovitch2008] . We will restrict ourselves to the cases (n, 2, 2) for 
n being even. We shall describe the odd n case as an extension of the even case. 

The vertex of V that maximally violates the Mermin inequality for n being even 
is p{l\s) = 5)^^^ where /(s) = 0^^^^ Si(0Lj+i ^k) |Marcovitch2008] . When we 
refer to /(s) in this subsection we mean this function in particular. If we were 
to allow communication then this function could be performed deterministically. 
One method would be if each jth party received the inputs Sk for all kih. parties 
where {j + 1) < k < n and 1 < J < (n — 1). Each party did not even need to 
learn every other party's input. This protocol also works if we cyclically permute 
the parties as the function /(s) is invariant under all permutations of parties. 
We now show that this communication protocol can be "simulated" if we perform 
detection post-selection. 

We now describe how we can achieve the vertex p(l|s) = '^j(s) of V correspond- 
ing to the function /(s) = 0"=! Sj{@i^^-_^^ Sk) with LHV correlators. We do this 
by simulating the above communication protocol using detection post-selection. 
We call this post-selection protocol the "Mermin-Klyshko post-selection" (MKP) 
protocol: each A;th party for 2 < A; < n produces the map = Sk ® VkW © 1 
where all n parties share the {n — 1) bit- values Uki^)- Party 1 produces the map 
ti = 1. As before, the variables yki^) are randomly generated. Therefore after 
detection post-selection, all parties have mapped the inputs Sk for j ^ 1 onto 
the shared variables yk{^)- Then each jth party for 1 < j < (n — 1) outputs 
the value rrij = Sj(0^^j^]^ yA:(A)) = Sj(0^^j_,_^ Sfc) and the nth party outputs 
nin = 0. As a result, we obtain the correlator ]5(l|s) = ^j(s)- It is worth noting 
that we need all (n — 1) maps tfc = Sfe © VkW © 1 so that the first party can 
obtain all other inputs. 

As in the previous subsection, in order to consider the detection efficiency we 
need to consider the space £e- We need to consider the probabilities p{t = 1) 
for the LHV maps tj. The MKP protocol produces p(t = 1) = ^(}_-^) ■ However, 
p{ti = 1) = 1 in this protocol. To counter this the n parties share the vari- 
able z{\) £ {1,2, ...,n} which corresponds to each cyclic permutation of the 
n parties. This variable is randomly generated and then the n parties pro- 
duce the MKP protocol but for a particular cyclic permutation. As a result, 

p(t = 1) = ^ 2("-i) ^ ^ 2("-i) • 

Therefore, LHV theories can produce a convex combination of expectation 
values i(s) = (-1)»(^) and l(s) = ^^{-iy'^^'> where /(s). We can substitute 
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these expectation values in the Mermin-Klyshko inequaUty for even n 

1 ^(_l)/(s)t(s)<2, (4.15) 

s 

For the expectation value ]E(s) = ^(n-i) (~1)'^^^\ the Bell expression takes the 
value 2 "2. This inequality is therefore not violated. For odd n, the Mermin- 
Klyshko inequality can be rewritten as |Marcovitch2008] 

;;^EC(n-i,,(-l/^^^%)<2. (4.16) 

We can use the same argument for even n to show that this inequality is not 
violated for any vector of expectation values in C^. First, one can use the MKP 
protocol, as for even n, to give ]E(s) = (— l)'^^^-'. This gives a value of 

3 — n 

2^2^ and so does not lead to a violation. On the other hand, due to the delta 
function , the function /(s) is now independent of s„ and becomes 

+1 Sfe ffi !)• This function can be achieved by (n - 1) 

5 — n 

parties carrying out the MKP protocol, thus producing a value of 2~2~ for the 
Bell expression. In summary then, the Mermin-Klyshko inequality is not violated 
for all expectation values in £e- 

It now remains to express p{t = 1) in terms of detector efficiency ij. Again we 
assume that r/ is the same for all sites and so can be calculated from the number 
counts A't (for t / 0). Therefore taking the limit of = ^t^^o -^t — ^ co, the 
efficiency is 

(4.17) 



^{i,t'} 



^{i,t'} + ^{o,t'} 

where t' 7^ is any of the bit-strings for all of the 2-party sub-sets of all 3 parties. 
The notation {0,t'} ({l,t'}) then says that the other bit not in the sub-set t' is 
(1). We can obtain values of iV{o,t'} terms of rj and -/Vji^t'} substitute 
them into an expression for p(t = 1) (using recursion) to obtain 

p(t = 1) = ^. (4.18) 

^1-1-77)" ^ ^ 



If we substitute the maximal quantum violation of the Mermin-Klyshko inequal- 

n + 1 — 

ity 2^2^ for the expectation values IE(s) then we have the following expres- 
sions p{t = 1) = 2^~. Therefore, for n = 3, detection efficiency must satisfy 



rj > i(\/21 — 3) ~ 0.7913 in order to demonstrate a loophole-free violation of 
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a Bell inequality. Whilst this is a decrease from the GM bound, this value of 
r] does not decrease dramatically; for example for n = 25, r] > 0.7170 but for 
n = 75, ?7 > 0.7104. The bound of r/ > | found by Eberhard (and subsequently 
improved) is more effective for a loophole- free Bell test |Eberhardl993| . 

While these generalisations of the GM bound on r] may not be impressive 
compared to the current literature, our discussion has been motivated by a qual- 
itative description of loopholes. We have also connected the detection loophole to 
communication protocols (cf. |Barrett2011j ). Detection post-selection can simu- 
late communication between parties by correlating input data to shared hidden 
variables. We used this simulation of a communication protocol to derive these 
generalisations of the GM bound. We have also used our computational de- 
scription of all possible LHV maps to make this loophole-producing mechanism 
clear. 

4.1.4 Summary of Loopholes 

We have discussed how experimental imperfections in Bell tests can lead to loop- 
holes. We have briefly covered how loss of measurement freedom and no space- like 
separation can lead to loopholes. In more detail, we have discussed how the sub- 
tleties of the detection loophole can be made clearer with the language of Boolean 
functions. Our language in terms of computational expressiveness allowed us to 
redrive the GM bound and generalised it to n parties. 

Beyond the loopholes we have discussed already, we will now briefly mention 
another: the memory loophole |Barrett2002j . The memory loophole emerges if 
parties retain their choice of input and subsequent output in a "memory" that can 
be communicated between parties in-between tests. From this memory, parties 
can make "educated guesses" about which measurement outputs to give for a 
particular input. This problem occurs from a finite number N of Bell tests from 
which we produce correlation statistics. However, the loophole does not become 
an issue as N ^ oo |Barrett2002j , heuristically, the region of the LHV polytope 
outside of the linear Boolean functions disappears exponentially in A^. Since we 
have assumed that all statistics from experiments are obtained in this limit, the 
memory loophole is not a conceptual, problematic issue. 

In the next section, we look again at post-selection but not from an experimen- 
tal point-of-view. We will assume that Bell tests are perfectly implemented in 
the laboratory. The post-selection introduced establishes a relationship between 
measurement data in a non-trivial fashion. We have shown that with detection 
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post-selection, relationships are induced between hidden variables and measure- 
ment settings, thus leading to loopholes. In this new setting we will define a 
loophole in analogy to the definition in this section. Given this definition, we 
show that this new form of post-selection is free of loopholes. This new method 
is a way of conceptually modifying Bell tests but not modifying the implications 
of LHV theories. 

4.2 Loophole-free Post-selection and Quantum 
Correlators 

In the previous section, post-selection was a necessity in order to calculate corre- 
lators. For non-detection events, measurement outcomes are not defined so the 
sum modulo 2 of outcomes could not be calculated. We now explore the use of 
post-selection utilised by the experimenter out of choice rather than necessity. 
We assume that the Bell test has perfect detectors and the experimenter does not 
need to use detection post-selection. Therefore, data is perfectly obtained by the 
experimenter but they still choose to discard some of this data. We will construct 
a new model to reflect this choice and discuss the possibility of loopholes in this 
model. 

For all of the discussion so far in this thesis, the variable s for each correlator 
p{k\s) has two functions: 1) it labels the inputs to all sites corresponding to 
the choice of measurement settings; 2) s acts as a conditioning variable for the 
probability measure on all maps — )• In this section we will distinguish 
between these two roles by using post-selection on measurement data. This is 
done by relating measurement data to data that is independent of measurement 
settings or outcomes. We motivate this discussion by returning to the Mermin 
inequality |Merminl990j : 

p(l|000) +p(l|011) +ij(l|101) -p(l|110) < 2 (4.19) 

and recall that as in the GHZ paradox, we are interested in correlators when 
•S3 = si ® S2- In the language of computer science, this is called a promise on the 
inputs that they satisfy a particular relation |Cleve2004] . This inequality is also 
superficially similar to the CHSH inequality but now with a third party whose 
inputs are related to the other two sites. 

To make the connection to the CHSH inequality clearer, we notice that the 



133 



linear Boolean functions that LHV theories can achieve if S3 = si © S2 are written 
as mi © 7712 ® 7723 = asi © /3s2 © 7S3 © (5 = a' Si © /3's2 © 5 for a, (3, a', /?', 7, 
5 G {0, 1}. These functions arc exactly the linear Boolean functions for the 
(2, 2, 2) CHSH setting. Therefore, the linear Boolean functions that satisfy the 
CHSH inequality also satisfy the Mermin inequality. The Mermin inequality can 
be seen as a manifestation of the CHSH inequality. 

If we reconsider experimental implementations of Bell tests, then how do par- 
ties obtain the input S3 = si © S2 if they are space-like separated from the other 
two parties? A possible solution is through data post-selection; the third party 
makes a completely random choice of S3. After receiving all data m and s from all 
parties the experimenter only accepts data from all parties and calculate p(l|s) 
if S3 = si © S2; otherwise data is discarded. Since si and S2 are also randomly 
generated, the rate at which the experimenter discards the data will tend to g 
for N ^ CO runs of the experiment. 

We will proceed to generalise this method of post-selection utilised in the GHZ 
paradox. Central to this approach will be the linear Boolean functions. In 
the example of the GHZ paradox, the experimenter post-selects on one input 
being a linear Boolean function. This keeps the computational power of the 
LHV correlators confined to these linear functions. We showed in the previous 
section that loopholes can lead to LHV correlators performing non-linear Boolean 
functions. In analogy with the detection loophole, the post-selection in the GHZ 
paradox can be seen to avoid a loophole. This is the central insight in this section 
and we will now develop these ideas rigorously. 

4.2.1 Post-selection, Linearity and Loopholes 

We now introduce some more general structure beyond the GHZ paradox. The 
experimenter now has some bit-string x of length |x| < n. Referring back to 
the two roles of s described above, x now plays the role of conditioning vari- 
able (role 2). That is, instead of the stochastic maps J'(lls) being conditioned 
upon s, they are now conditioned upon x, i.e. the experimenter calculates 
p(l|x) = m-j = l|x). The experimenter then relates their data x to 

the experimental data m and s. 

If we return to the GHZ paradox, we have three parties but the bit-string 
x G {0, 1}^. The experiment now accepts, or post-selects on data m and s when 
si = xi, S2 = X2 and S3 = xi © X2 is satisfied. Then the experimenter calculates 
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the correlator 



p(l|x) = p{mi © 7712 © ms = l|si = Xi, S2 = X2, S3 = Xi® X2). (4.20) 

A relationship between data s and x is established by the experimenter's post- 
selection. We now generalise this approach of relating s to x with the following 
form of post-selection. 

Definition 10. If an experimenter accepts, or post-selects on data m and s for 
every input sj satisfying sj = gj{x) where gj{x) is some Boolean function on x, 
this is input post-selection. The experimenter fixes this relationship between 
X and s for all runs of the experiment. After post-selection, the experimenter 
calculates p(l|x) for their value of x. 

After the post-selection there are now 2l^l correlators p(l|x) for all values of x. 
Just as with the correlators p(l|s), the correlators p(l|x) are elements of vectors 
fcx S M^'''' . The deterministic correlators are ^'(l|x) = (5^^^^ for all Boolean 
functions /(x). Therefore, vectors are contained in a convex polytope Vx 
with these extreme points being these deterministic correlators. There will also 
be the space of LHV correlators £x in analogy to C. If n = |x| and the functions 
5(j(x) in input post-selection are sj = xj, then we recover the original (n, 2,2) 
Bell test. For this example, £x is the convex hull of linear Boolean functions 
on X. In analogy with the detection loophole defined in the previous section, we 
now define a loophole for input post-selection. 

Definition 11. A loophole is introduced by an experimenter into a Bell test if 
after input post-selection, the space £x is larger than the convex hull of linear 
Boolean functions on x. 

In the next subsection we will show how loopholes are avoided if the experi- 
menter utilises input post-selection. We will then develop input post-selection in 



subsection 4.2.3, now to encompass a relationship between x and both m and s. 
This new form of post-selection will be called "output-input post-selection". In 
this case, we can still find a way to avoid loopholes in the sense that £x remains 
the convex hull of linear Boolean functions. We now address loopholes in input 
post-selection. 
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4.2.2 Linear Input Post-selection in {n, 2, 2) tests 

We begin our discussion with a key result for setting post-selection. This result 
informs us of how to avoid loopholes and will lead to us describing a particular 
class of input post-selections. 

Theorem 17. The space Cx of LHV correlators is the convex hull of linear 
Boolean functions on x for input post- selections with sj = gj{x) if and only if 
every gj{x) is a linear Boolean function on x. 

Proof: First we prove the if statement. We only need to consider the extreme 
points of C corresponding to the deterministic linear Boolean functions /(s) on 
s, i.e. /(s) = ©"=1 ajSj b with Oj, b £ {0, 1}. If we post-select on sj = gj{x) 
being a linear Boolean function, then /(s) — )• /(x) = ©"^^^^^^(x) © b, which 
is again a linear Boolean function now on x. To prove the only if statement, 
if 5j(x) is a non-linear Boolean function, then extreme points of C producing 
/(x) = ©j=i o,jgj{x.) © b will in general be a non-linear Boolean function for all 
aj and b. □ 

From this result, £x will be defined by the facet Bell inequalities for the 
(|a;|,2,2) setting replacing p(l|s) with p(l|x). Returning to our example, the 
Mermin inequality (with replacing s with x) is the CHSH-like inequality defining 
a facet of £x with |x| = 2. This all occurs only if the input post-selection consists 
of gj{x) being a linear Boolean function. We now formally define this particular 
class of post-selections: 

Definition 12. Linear Input Post-selection (LI) is input post-selection but 
where all of the functions 5j(x) are linear Boolean functions on x. 



This post-selection can be seen to simulate nMBQC as described in section 3.4 
of chapter [3} Instead of pre-processing on inputs which are then distributed to n 
parties, we post-select on inputs satisfying the expressions that are described by 
the P matrices. Since LHV resources can only produce linear Boolean functions in 
nMBQC, then our post-selection simulates a model with the same computational 
power. Crucially both the pre-processing and post-selection is restricted to the 
linear Boolean functions. 

The connection to MBQC can now be extended by considering adaptivity. In 
adaptive MBQC, inputs, or measurement settings at each site are influenced by 
previous measurement outcomes. Translating this into a Bell test, the input 
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Sj is now a function h{mji\\/j' ^ j) of measurements outcomes rrij/ from other 
sites. Directly, this would assume communication between sites. However, if 
we post-select on inputs Sj satisfying this function h{mji\\/j' ^ j), then we can 
simulate this communication. We now discuss this form of post-selection and 
show, remarkably, that we can avoid loopholes. 

4.2.3 Linear Output-Input Post-selection in (n,2,2) tests 

We now extend input post-selection to consider functional relationships induced 
between the experimenter's variable x and m and s. In particular, for the jth 
site, Sj can be related to outcomes niji for j ^ j' . We introduce the notation 
m^-' to describe a (n — l)-length bit-string which is m but without the bit-value 
rrij. For example, if j = 1, then m^-' = {1712,1713, With this new piece of 

notation we now introduce a new form of post-selection, first studied by Hoban 
and Browne |Hoban2011b| . "Output-input post-selection" is now the same as 
input post-selection but the experimenter now accepts data when Sj = gj{'X-, m^-') 
instead of Sj = gj{x). Again, after the post-selection, the experimenter again 
calculates p(l|x) for each x. 

For this output-input post-selection, the space of all possible correlators is Px, 
the same as input post-selection. For LHV correlators, we describe the space of 
correlators after output-input post-selection as £^ . As an extension of the 
definition of a loophole for input post-selection, a loophole emerges if is 
larger than the convex hull of linear Boolean functions on x. We now show when 
loopholes in output-input post-selection can be avoided. 

Theorem 18. The space of LHV correlators is the convex hull of linear 

Boolean functions on x for output-input post-selections sj = gj{x,'mS^) if and 
only if every gj{x, mS^) is a linear Boolean function on x and 

Proof: First we prove the if statement. We recall that all deterministic LHV 
single-site maps can be written as mj = ajsj © f3j and we can take their convex 
combination. We assume that Uj and f3j is dependent on an LHV A but these 
variables are in no way correlated with the inputs s. Therefore, all extreme points 
of C from these deterministic maps result in ©"=1 ruj being a linear Boolean 
function on x and m^-' . 

If we do not assume that the values aj and /3j for all j are not correlated to 
s, there is a way in which this post-selection can allow correlations between bits 
from the LHV, aj, f3j and inputs s. We now demonstrate this method. We can 
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decompose a linear function (?j(x, m^-' ) as gj^\in\^) gj^\'K), i.e. in terms of the 
linear functions gj^\'m\^) and gj^\x.) on m^-^ and x respectively. The outcomes 
in m^-' contain information about A, but sj is random and uncorrelated to A, 
m and x. Therefore gj^\m\^) = gj^\x.) © sj means that gj^\m\^) is random 
and uncorrelated to g^^^ (x|^ These random bits sj play the role of the pad-bit 
in one-time pad cryptography which Shannon |Shannonl949j proved is perfectly 
secure for encrypting messages. 

We finally prove the only if statement. If gj{in\^ ,x) becomes non-linear then 
we can always produce this function /(x) = gj{m\^ ,x) as an output. Since values 
of m^-' can be made to be equal to values of x, there always exists a non-linear 
function in x if gj{m\^ ,x) is non-linear. □ 

We now call output-input post-selection where gj{x,mS^) is a linear Boolean 
function on x and m^^, Linear Output-Input Post-selection (LOI). With 
LOI, we can simulate signalling processes by making inputs dependent on outputs 
at other sites. But, we can also keep the space of correlators confined to the linear 
Boolean functions on x. This means that for all n > |x|, the space /^xm\j 
LOI is the convex hull of linear Boolean functions. The n- independence in 
the space of correlators is unusual given that in traditional Bell tests, the role of 
the number of parties is important. In some way, by considering |x|, we unify all 
possible multi-party Bell settings for n > |x|. 

With regards to quantum correlators, we have already indicated that there is 
an n-dependence in the example of the Mermin inequality. For n = |x| =2, 
the maximal violation of the CHSH inequality is 2^/2. After LI, for n = 3 and 
|x| =2, the same CHSH inequality in terms of p(l|x) has the maximal violation 
of 4. In the next subsection we will discuss the effect of LI and LOI upon the 
space of quantum correlators. 

4.2.4 Bipartite Quantum correlators under post-selection 

The space Qx of quantum correlators under LI needs to be specified for a partic- 
ular value of n, i.e. Q^. For LOI, the corresponding space of quantum correlators 
is for n number of parties. Since LOI includes all possible post-selections 

in LI, then necessarily C . 



^If (?^^'(x) = 0, the bit Sj does become correlated with other sites' measurements rrik and 
hence A but Sj will be uncorrelated to x. If g^j^\m^^) = 0, we recover LI post-selection. 
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We now focus on |x| = 2 as the smahest example of non-trivial behaviour 
of Since is strictly smaller than Px for |x| = 2, we can 

initially ask whether is larger than Q^? This turns out not to be the 

case as we now demonstrate. For this situation, the most general LOI possible 
involves post-selecting on the following relations being satisfied: si = xi © am2 
and S2 = X2 (B /3mi with a,f3 ^ {Oil}- When q = /3 = 0, we retrieve the 
standard, well-studied scenario. The two scenarios where a ^ f3 are equivalent 
up to changing of labels. If we consider the scenario where {a, /3} = {0, 1} then 
the probabilities 1 x) can be rewritten in terms of probabilities p{mi, m2\si, 82)- 

p(l|x) = ^ 6^'^®"^^p{mi,m2\si = xi, S2 = X2 (B mi) 

mi ,m2 

= p{0,l\si = Xl,S2 = X2) + p{l,0\si = Xl,S2 = X2 ® 1). (4.21) 



The correlator can be written in this way as p(m|s) is a non-signalling distri- 
bution. Any non-signalling probability distribution can be written as a con- 
vex combination of the vertices of MS. For the bipartite scenario there are 
two types of vertices: 1) local vertices where p{mi,m2\si, S2) 



for aj, bj E {0,1}; and 2) "non-local" vertices for p{mi,m2\si, S2 



■ \ for 

mi © m2 = (si © a){s2 © © c, and otherwise where a, 5, c G {0, 1}. In 
the former case, when si = xi, mi is either or 1 deterministically and so (4.21 ) 



must be 1 for a local vertex. For a non-local vertex, (4.21 ) takes any of the values 



{0, i, 1}, so for non-signalling distributions, every correlator of the form (4.21) is 



at most 1. 



To see if any correlators are outside of C^, we put the correlators in (4.21 ) into 
the CHSH inequality (and any in its symmetry group) to obtain 



p(0, 1|0,0) +p(l,0|0,0) +p{0, 1|0, 1) +p(l,0|0, 1)+ 
p(0,l|l,0)-p(l,0|l,0)-p(0,l|l,l)+p(l,0|l,l) < 2. 



(4.22) 



All bipartite non-local vertices satisfy 



p(0,l|l,0) -p(l,0|l,0) = -p(0,l|l,l)+p(l,0|l,l) = 0. 



(4.23) 



Therefore, all non-signalling probability distributions do not violate the CHSH 
inequality with LOI for a ^ f3. As a corollary, quantum correlators satisfy the 
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CHSH inequalitjQ 

Finally, for the scenario of LOI with a = f3 = 1, then mi©m2 = si(Bs2(Bxi(Bx2- 
The correlators p(l I x) are calculated when si©S2 = a;i®X2®l, sop(l|x) =p(l|x') 
for X = x' if xi © X2 = x[ ® Substituting these values of the correlators into 
all of the CHSH inequalities never yields a violation, as two of the correlators 
will cancej^ To summarise then, for LOI for aj^f3ora = l3=l, quantum 
correlators do not exceed the LHV polytope. Therefore the space of quantum 
correlators Ql.^\j = Ql_, with being Q for (2,2,2). 

4.2.5 Multipartite quantum correlators 

We have looked at the scenario when n = 2, we will now consider the space of 
quantum correlators Q^^\j for general n and |x|. Having shown that LIO has 
no impact on quantum correlators, we now show the opposite in the multipartite 
setting. That is, the space of quantum correlators under LIO can be larger than 
the space of quantum correlators under LI. We begin by considering the n = 3 
scenario and then use it to consider larger n for a particular |x|. 

Firstly, we observe that Q^^\j = "Px for |x| = 2. From the GHZ paradox, 
p{mi © 771-2 ffi w-s = l|x) = (5i^i^2ffii _ I gg^]^ map from the function 
Xi2;2 ® 1, to the other non- linear Boolean functions (xi © a)(x2 ® 6) ® c (with a, 
b, c £ {0, 1}) with relabelling of bit-values Xj. All vertices of can be achieved 
by quantum correlators for n = 3 and |x| = 2. 

We might ask whether quantum correlators can saturate the whole of for 
particular values of n and |x|? In the following lemma, quantum correlators for 
a given n and |x| can saturate the whole space "Px- In particular, any vertex of 
T^x corresponding to non-linear Boolean functions can be attained with quantum 
correlators for a particular n. We use the ri = 3, |x| = 2 case to demonstrate this 
fact. 

Lemma 19. For all \x\, Q^^\j contains the vertex p{l\x) = Sj^^-^ for f{x) = Hjii 
if n = 3{\x\ — 1). 

Proof: We prove this with an explicit LIO protocol. If we have n = 3y parties 
for y as some non-zero positive integer, and divide them into y sets of three neigh- 

^The inequalities in the CHSH inequality symmetry group are also not violated as we can map 
to all inequalities in this group via local re-labellings or an overall sign change, and we can 
also map from every non-local vertex of MS via the same operations 

^After the terms that are equal but have opposite sign pre-factors in the inequality cancel, the 
inequalities reduce to either 2p(l|x) < 2 or — 2p(l|x) < for a particular value of x 
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bouring parties in the following way {{l,2,3},...,{3y — 2,3y — 1, 3y}}. For each 
of these y sets {j, (j + 1), {j + 2)}, if inputs are sj, sq+i) and S(j+2) = Sj © sq+i); 
then we can have mj©m(j+i) ©m(j_|_2) = s^sq+i) with quantum correlator^ For 
j = 1, if the experimenter post-selects upon si = xi, S2 = X2 and S3 = xi © X2, 
then we have the situation for n = 3 and |x| = 2 discussed above. 

For j = 3fc + 1 with A;G{l,2,...,(y — 1)}, the experimenter post-selects data 
if Sj = 0/^1 m;, = © 1 and S(j+2) = ^jzl m © Xfc+2 © 1- For 

j = 4, this results in S4 = X1X2, S5 = X3 © 1 and sq = X1X2 © X3 © 1, and so 
J7T.4 © ms © mg = xiX2(x3 © 1), resulting in 0f=x ~ X1X2X3. Then by itera- 
tion, for k > 2, the above protocol results in 0^^^ = nf=i^^^- Therefore, if 
y = |x| — 1, the function /(x) = nl=i ^e achieved deterministically with 

n = 3(|x| - 1). □ 

If we consider LI, we know that for n = 2^^^ — 1, every Boolean function can be 
achieved deterministically. This is because LI can simulate nMBQC directly, and 
in nMBQC we need at most this number of parties to achieve all Boolean func- 
tions. Translated into the language of post-selection, = Vx for n = 2l''l — 1. 
What is more, we showed in Theorem 14, that to achieve p(l|x) = (Jj^^-j for 



/(x) = niii Xj with nMBQC, we require no fewer than n = 2l^l — 1 parties. For 
|x| > 3, 2l^l — 1 > 3(|x| — 1). This then gives us the following result. 

Theorem 20. Q^^\j can be be larger than for a fixed n and \x\. 

If we utilise LOI for a particular number n of parties, then we can get a larger 
violation of a Bell inequality with this LOI than with LI. Quantum correlators 
can be perceived to be "more non-local" if we process our measurement data in a 
particular way. Then the action of discarding data can not only allow classical, or 
LHV correlators to simulate quantum correlators (as in the detection loophole), 
but used to emphasize the non-classical aspect of quantum physics. 

The LOI can simulate a circuit where some outputs can affect some inputs. 
Traditionally, Boolean circuits have sequential gates so there is a temporal order 
of processes. In our post-selection, Boolean functions result from resources that 
are without temporal order or space-like separated. There has been a great deal 
of research into a field called Boolean circuit complexity and there is a natural 

^Corresponding to the maximal quantum violation 1 of the Mermin inequality 
-p(l|0,0,0) -p(l|0, 1,1) -p(l|l,0, 1) +p(l|l,l,0) < 0, equivalent to the original Mermin 
inequality with the symmetry operation being adding 1 (modulo 2) to the joint outcome 
mi © 7112 © ms. 
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overlap with discussion of this field to our discussion of LOI. Boolean circuit com- 
plexity asks how many fundamental operations or gates are required to perform 
any Boolean function (e.g. the AND and NOT gates) |Papadimitrioul994| . The 
application of Boolean circuit complexity results to LOI would be an interesting 
avenue of research. 

In this section we have shown that post-selection can be used to conceptu- 
ally change Bell tests. The post-selection described also establishes a link be- 
tween Bell tests and the full MBQC model described by Briegel and Raussendorf 
[Raussendorf2001| . Whilst LI simulates nMBQC, the adaptivity in MBQC can 
be simulated by LOI. With this post-selection, processing on measurement data 
utilises addition modulo 2, as with the classical computer in MBQC. Heuristi- 
cally, the Bell test with LOI is akin to a single round of measurements in MBQC, 
but we only accept the circuit if it corresponds to an adaptive circuit in MBQC; 
we discard the circuit otherwise. This is analogous to post-selected quantum 
state teleportation where we accept, post-select our system on the "correct" mea- 
surement outcome resulting in teleportation |Lloyd20TT] . 

Central to our discussion in this section has been the computational description 
of correlators. We then used this computational description to consider input 
and output-input post-selection. We can limit the computational implications 
of this post-selection if we restrict ourselves to measurement data being related 
by linear Boolean functions. Linear Boolean functions are associated with LHV 
correlators. If all processing on data consists of linear Boolean functions, the 
computational power of LHV correlators remains linear. However, for general 
{n, c, d) scenarios, LHV correlators are associated with n-partite linear functions. 
We show in the next section, that generalising LI and LOI to these more general 
scenarios can be very problematic. 



4.3 General settings and Input Post-selection 



The discussion in the previous two sections 4.1 and 4.2 of this chapter have been 
in the (n, 2, 2) scenario. We now consider generalisations of input post-selection 
to (n, d, d) scenarios where d is prime. We have demonstrated that we can avoid 
loopholes in the (n, 2, 2) scenario, is this true for all d? In order to address this 
question we need to generalise the approach developed in the previous section. 
We will introduce two natural generalisations of the post-selection in LI, and we 
show that it is not loophole free. Despite this, we will again give an indication 
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that the region of quantum correlators can be enlarged by post-selection. We 
now proceed to introduce the framework for input post-selection in the (n, d, d) 
scenario. 

|x| 

As before, the experimenter has some |x|-length digit string x G ZJ^ which 
they have chosen. He receives data m and s from ah n parties and then accepts 
this data if inputs sj are equal to some function gj{x.) on x. If this function is not 
satisfied by all Sj then the experimenter discards this data. Once the data has 

and produce 

d 



En 



been accepted by the experimenter they calculate k 
the correlator p(/c|x). 

The space of all possible correlators is T^x) the convex polytope of correlators 
p(/c|x) = '^j(x) ^'-'^ function /(x) : — )• Z^. For the trivial post-selection 
Sj = Xj where n = |x|, then the space of LHV correlators is the convex 
hull of n-partite linear functions on x. This space might be dependent on n, but 
this is implicitly assumed in our notation. As in the {n, 2, 2) case, we define a 
loophole as a form of input post-selection that results in Cx being larger than 
the convex hull of n-partite linear functions. 

So far, this framework for all (n, d, d) scenarios for prime d is almost identical 
to the (n, 2, 2) case. What is the generalisation of LI for this more general case? 
For (n, 2, 2), the linear Boolean functions on x are both n-partite linear functions 
and the addition modulo 2 of variables Xj (upto some additional constant). In 
the (n, d, d) scenario, functions consisting of sums of elements Xj modulo d are a 
subclass of all n-partite linear functions; we call these functions afRne functions 
on X. In the next subsection we will consider input post-selection for affine 
functions gj (x) . We will then consider the case where gj (x) is any n-partite linear 
function. In both cases, loopholes are introduced by the input post-selection. For 
all n-partite linear functions gj (x) , the space of LHV correlators is equal to 
for some n; this is not possible for the affine functions 5j(x). Finally we will 
briefly discuss the space of quantum correlators under input post-selection. 

4.3.1 Input Post-selection with Affine Functions 

We now consider input post-selection where (7j(x) are the affine functions. The 

,|x| 



affine functions can be written as /i(x) = b + Ylj=i '^j^j 



for a,, 6 G Zw. It can 
d ■' 

be readily seen that for d = 2, these functions are the linear Boolean function 
We now define the class of input post-selections for the affine functions. 



10 



Linear Boolean functions are often referred to as afSne Boolean functions. 
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Definition 13. Affine Input Post-selection (AI) is input post-selection where 
the experimenter accepts data when ah sj satisfy sj = h{x.) where /i(x) is an affine 
function on x. 

The space of LHV correlators under AI is written as C^^. We are now in a 
position to present the following result that shows that AI introduces loopholes. 
Whilst loopholes are introduced, is still smaller than the space of all possible 



correlators. This fact will be utilised in subsection 4.3.3 to highlight the space of 
quantum correlators for AI. 

Proposition 21. The space C^^ is larger than the convex hull of n-partite linear 
functions but smaller than Vx for n> \x\. 

Proof: We first use the results from chapter [2] to describe n-partite linear 
functions for c = d being prime: 



/(x) 



a 



n (d-l) ( {d-l) 

+ E E 1 - E (-1)' r 7 



j=i k=i 

n (d-l) 

with a G and G TL^ where 



z=o 



.^9 



(4.24) 



J d 



a 



n {d-l) 

j=i k=i 
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(4.25) 



and 



j,<i 



n {d-l) 

E E /^.^(-i)" 
j=i k=i 



11 (^-1) \l,d-{q+l) 



(4.26) 



Thus n-partite linear functions are the sum modulo d of powers of sj . In AI we 
calculate correlators p(/c|x) after post-selecting on sj = 6 + Yl^^li (^j^j 

L •' J d 

6 G Zrf. Therefore the extreme points of C corresponding to the n-partite linear 
functions get mapped to extreme points of with extreme points p(A;|x) = 
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corresponding to the functions 



/(x) 



a 



n |x| 

j=l q=l j= 



(4.27) 



which is a function consisting of multiplication between elements of x. This func- 
tion is not an n-partite linear function on x. The space of LHV correlators 
under AI is then not confined to the convex hull of n-partite linear functions on 

X. 

However, does not contain all vertices of "Px- In other words, the function 
in (4.27) is not equal to all functions / : ZJ. — )• Zd- We can demonstrate this by 



the example of the function /(x) = nliiC^j)^'^ ^^^^ cannot be produced 



by powers of ^ + X^fci ^j^j ■ Therefore C^^ y^Vx-O 



We have shown that AI is not a loophole-free form of post-selection but LHV 
correlators cannot saturate the whole space Px- This is somewhat analogous to 
discussion of the detection loophole, where for detection efficiency above some 
threshold, LHV correlators do not saturate the space of all possible correlators. 
In the subsequent subsection we will consider a more general class of input post- 
selections where gj(x) is now an n-partite linear function on x. As a corollary 
of the above result, these input post-selections are also not loophole- free. How- 
ever, in this new class of input post-selections, LHV correlators have greater 
computational expressiveness. 



4.3.2 Input Post-selection with n-Partite Linear Functions 

We now define input post-selection for n-partite linear functions 5j(x). As can 
be seen from this definition, this post-selection includes AI, and therefore is not 
loophole- free. 

Definition 14. n-Partite Linear Input Post-selection (PI) is input post- 
selection where the experimenter accepts data when all sj satisfy sj = h{x.) where 
h{x) is an n-partite linear function on x. 

Again, we can define the space of LHV correlators under PI as C^^. This space 
is thus larger than the convex hull of n-partite functions on x. In the following 
result we show that the space can be equal to Vx for particular instances of 
Ixl and n. 
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Proposition 22. The space jC^^ is Vx for a large enough nif\x\<{d—l) 



Proof: First we point out that for d being prime, any function /(x) can be 
written as a polynomial of elements Xj in the following way: 



/(x) 



E 



z62 



(4.28) 



|x| 

with G "Ld where z G ' are digit-strings. We now demonstrate that there are 
values otn = n' when we can achieve any of the polynomials niii(^j)^^ ' 

then we can take rfl^l sets of these n' parties; each set outputs Ozlljiil^j)^^ 
and we take the sum modulo d of all the sets outputs and as a result produce 
/(x). 

Now wc demonstrate that for n = n' parties we can produce the outcome 



En 
i=l 



niii(^i)^^ deterministically. First, we show that all polyno- 

id 

mial terms of length 2, i.e. rilii(^j)^^ with only 2 non-zero terms in y, can be 
produced and proceed by induction. The length 2 polynomials can be achieved if 
a party outputs mj = [(sj)^] ^ which is an n-partite linear function on s. We then 
post-select on sj satisfying the n-partite linear function on x in the following way 
Sj = [{xi)y^ + {xi')^^']^ where I and /' labels the 2 elements of y which are non- 
zero. After this post-selection mj = [(.x;)^^' + (x//)^^'' -|- 2(2;;)'^' (x;/)^''] ^ and if we 
have two other parties that each outputs rnj^i = [— Sj+i]^ and mj^2 = [~'Si+2](^ 
and post-select on sj+i = {xif'^'- and Sj+2 = (a^r)^^''- Then if we take the sum 
modulo d of these three outcomes we obtain mj©mj+i©mj+2 = [2(xi)J"(x;/)^'']^, 
which is a length 2 polynomial. We can repeat this process with q sets of three 
parties and take the sum modulo d of the joint outcomes of all sets to obtain 
[2q{xi)yi{xif)yi']-i^ = [{xi)y^{xi:)yi']^ such that [2q]^ = 1 as d is prime. 

For |x| = 3, we have another party outputting my = [{sj)'^]d and post-selecting 
on the n-partite linear function on x, Sjt = [(.xi)^i -|- (2:2)^^ + (a^s)^'^]^^- Thus 
we produce m^v = [{{xi)y^ + {x2)y^ + {x^Y'f] ^ = [3!(xi)fi (xsj^^xg)?'^^ + ...]^ 
where represents length 2 polynomials of x. The length 2 and 1 poly- 

nomials can be subtracted from this output from the j'th site as they can be 
produced by other parties as shown above, so that the joint outcome can pro- 
duce [3!(xi)^^(x2)^^(x3)^^]^. Again by taking q sets of parties that output this 
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in total and taking the joint outcome of all q sets produces 



(4.29) 



for [6g]^ = 1 as d is prime. 

We can repeat this process for |x| > 3, where a party outputs rriji/ = [(sj")'^']^ 
and we post-select upon Sj" = Yl^^Lii^k] 



Vk 



. This results in 



|x| 
k=l 



|x| 



ix|!n(^fcr + - 



k=l 



(4.30) 



where represents length (|x| — 1) polynomials of x which can be subtracted. 
Finally, again we can taking an arbitrary number of parties and the sum modulo d 
of the parties outputs will be IlfcliC^fc)^'' • This all applies when |x| < (d — 1), 
and so when this is satisfied, all functions on x can be achieved with large enough 
n. □ 



Therefore in the presence of data post-selection that is a natural generalisation 
of LI post-selection, not only do we avoid loopholes, but we can completely satu- 
rate the space of all possible correlators "Px- This truly highlights the uniqueness 
of the scenario with binary inputs and outputs at each site. We now discuss the 
effect of input post-selection upon quantum correlators. 



4.3.3 Quantum Correlators and Input Post-selection 

For LI and LIO, the space of LHV correlators was unaffected, but the space of 
quantum correlators was n-dependent and could completely saturate T^x- Since 
LHV correlators can also saturate the whole correlator space with PI, we briefly 
consider the effect of AI on quantum correlators. The space of quantum correla- 
tors under AI post-selection is Qx^. As with >Cx, there may be an n-dependence 
on the size of Qx^ but for brevity we will not make this explicit in our notation. 
The main result of this subsection is that for |x| = 2 and n = 3, Q^^ is larger 
than >Cx^. We demonstrate this by an example for d = 3. 

We have already shown in Proposition [2l] that the vertex of "Px corresponding 
to the function /(x) = [(^1X2)^ + l]g is not in C^^. Therefore, we can adapt the 
non-trivial Bell inequality (3.63) from subsection 3.3.3 in chapter [sjfor correlators 
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p(l|x): 

1 



g (lj(l|00) +p(l|01) +p(l|02) +p(110) +p(2|ll)) 



+ -(p(2|12)+p(l|20)+p(2|21)+p(2|22)) < |. (4.31) 



The right-hand-side is exactly the same as (3.63), as all of the n-partite lin- 
ear functions coincide with /(x) = [(xiX2)^ + l]3 for 8 out of 9 values of x. 
Therefore, for all functions not equal to /(x), this is the maximum overlap 
between functions. will be a convex polytope of functions not including 

/(x) = [(xiX2)^ + ijg, thus giving at most | (for the Bell expression) for each 
of its extreme points. As discussed in chapter [3j this inequality is not violated 
by quantum correlators for n = 2. However, this inequality can be violated by 
quantum correlators for n = 3 with AI if si = xi, S2 = X2 and S3 = [a;i+X2]3. We 
used the MBS approach to find a lower bound of ~ 0.9314 > | on the quantum 



violation of (4.31). 



Even in the presence of post-selection that introduces loopholes, the space of 
quantum correlators can be larger than the space of LHV correlators. Whilst 
not as dramatic as the effect that LI and LIO has on the quantum region, it 
is never-the-less interesting how "tactile" quantum correlators can be. That is, 
even if we imbue LHV correlators with more computational power (as with AI), 
quantum correlators can still have more computational expressiveness. It would 
be an interesting avenue of research to consider how quantum correlators are 
affected by non-loophole-free post-selection and whether their power can always 
be "boosted" by this post-selection. 



4.4 Chapter Summary 

The practical motivations of implementing Bell tests in the laboratory have mo- 
tivated the study of loopholes and how they emerge when we have to reject 
"imperfect" measurement data |Pearlel970| . In this chapter we have used the in- 
sight from considering Bell tests from a computational point-of-view to say how 
and why loopholes emerge. By post-selecting on measurement data only when 
we have successful detection, we establish a relationship between the inputs and 
local hidden variables. This relationship allows other parties to indirectly learn 
the inputs of other sites via this shared data. By modelling this behaviour we re- 
trieved the GM |Gargl987 bound on the necessary detection efficiency required 
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to establish a loophole-free violation of a Bell inequality. We then subsequently 
improved upon their bound by considering more parties. 

Our improvement on the GM bound is not as impressive as the improvement 
attained by Eberhard jEberhardl993] in the Clauser-Horne inequality setting. 
Eberhard's bound of | has been improved upon further [Vertesi2010j . this was 
a result of considering more measurement settings at each site. It would be 
interesting to consider the Bell inequalities on the full probability distribution 
for (n, 2, 2) for n > 2, and whether the detection efficiency can be lowered further 
in analogy to our results. 

Despite the issues associated with post-selection, there is a scenario where if 
we have perfect detections but the experimenter post-selects on measurement 
data by choice, we do not introduce loopholes. We associate LHV correlators 
with a limited computational expressiveness in the (n,2,2) scenario: only linear 
Boolean functions can be achieved. If we post-select on data but only in a way 
that does not introduce non-linear Boolean functions, we avoid loopholes and 
still allow the possibility for a violation of a Bell inequality. In fact, we can 
increase the amount of quantum violation for particular Bell inequalities if we 
utilise post-selection. 

However, we have also shown that the (n, 2, 2) scenario is unique in the respect 
of not introducing loopholes; if we allow a greater number of inputs and outputs 
at each site, loopholes can again emerge. For c = d > 2, LHV correlators can 
produce powers of its input, and this inherent multiplication can be used to sim- 
ulate all possible correlators. The ability to produce addition and multiplication 
modulo d for d being prime can be enough to produce any function / : — )• Z^. 
This has highlighted both how fragile Bell tests are in establishing a distinction 
between quantum and LHV correlators, and also how much descriptive power 
is accumulated by considering Bell tests from the computational point-of-view. 
Since we have shown the intimate link between correlators and functions on 
digit-strings, discussing functions has allowed to capture part of the picture of 
loopholes in Bell tests. 

Interestingly, the models of post-selection we have discussed for (n, 2, 2) involve 
the same level of data processing involved in MBQC. With LIO post-selection, 
we can simulate time-like separated processes such as adaptive MBQC circuits 
without introducing loopholes. Modelling signalling processes within Bell tests 
could lead to an insight into why we obtain improvements in information pro- 
cessing for quantum resources. We will summarise and consider some of these 



149 



ideas in the final chapter. 
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5 Summary and Outlook 



The Bell inequalities have dictated and continue to dictate much of the discussion 
about the nature of quantum mechanics. In this thesis we have suggested that a 
general framework for Bell tests has a computational aspect. This both allows us 
to use methods and ideas in computer science to say something about Bell tests 
and methods developed in Bell tests to say something about computation. This 
collaboration between applied and fundamental science is what drives a large 
part of quantum information science |Hardy2010| . The diversity of connections 
addressed in this thesis have been made between the CGLMP-type Bell tests 
and basic number theory (in the form of functions on cyclic groups); loopholes, 
post-selection and quantum computing; we also connected quantum computing 
to non-local games and WW Bell tests. 

In chapter [2| we outlined our approach to Bell tests, in particular looking at 
correlators: the expectation value of joint measurements. We showed that cor- 
relators can be associated with a notion of computation, that is functions on 
inputs. The calculation of a correlator maps raw statistical data into a stochas- 
tic map from inputs to a single output. This operational description allows to 
then think about information processing. This framework and description also 
has something to say about non-signalling theories and Svetlichny's model of 
correlations. 

The discussion of correlators in chapter [2] was mostly in terms of the vertex 
description of convex polytopes. In chapter [Sj we shifted to discussing the Bell 
inequality as defining the convex polytope of LHV correlators. We used the vertex 
description from chapter[2]to numerically calculate the linear inequalities, or facet 
Bell inequalities that define this polytope. However, it was only computationally 
feasible to find these inequalities for a relatively small number of settings. Given 
the hardness of the computational problem, we then just discussed non-trivial 
Bell inequalities, relaxing the need for the inequality to be facet-defining, but still 
potentially be violated. These non-trivial expressions then necessarily bound the 
space of LHV correlators to be smaller than the space of all possible correlators. 
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Non-trivial Bell inequalities are not only useful for bounding classical correla- 
tions, they have a natural interpretation in terms of non-local games. We looked 
at these non-local games in the many party, two-input, two-output scenario and 
showed that they have a concrete connection to Measurement-based Quantum 
Computing (MBQC). In particular, nMBQC, the class of non-adaptive circuits 
in the Raussendorf and Briegel model of MBQC can be shown to be inequivalent 
to a full quantum computer. However, within this nMBQC structure we still 
obtain natural generalisations of both the GHZ paradox and PR non-local box. 

An interesting aspect of the nMBQC model is that data processing by a classi- 
cal computer does not imbue LHV theories with any more computational power. 
In chapter [4| we applied this insight to data post-selection in Bell tests. We 
showed that post-selection in Bell tests, such as post-selecting on detecting out- 
comes in imperfect experiments, is problematic and introduces "loopholes". We 
used the computational insight from the rest of the thesis to show how the detec- 
tion loophole can emerge and then rederived the Garg-Mermin bound on detec- 
tion efficiency |Gargl98"7] . Throughout this discussion our computational per- 
spective drove the understanding of loopholes. 

After showing how in imperfect experimental Bell tests, loopholes can emerge, 
we turned to a different framework for data post-selection. We assume that we 
have perfect detection and data collection, but we post-select on inputs satisfying 
certain constraints. We showed that LHV correlators are unaffected in their com- 
putational expressiveness by this post-selection. We associate this conservation 
of computational power with the post-selection being "loophole-free". The notion 
of a loophole in both frameworks for post-selection is heuristically connected as 
allowing LHV correlators to have more computational expressiveness than just 
the linear Boolean functions. 

The post-selection in the second framework is loophole-free if we constrain its 
form. These constraints however can still allow the post-selection to simulate 
the processing a classical computer imposes on data sent to measurement sites in 
MBQC. Also we can simulate signalling processes with this post-selection. This 
offers a potentially fruitful way of viewing quantum protocols and processes that 
have time-like separated elements into a framework where processes are now in 
the context of space- like separated parties. All of these results were developed 
in the (n, 2, 2) scenario, and we showed that generalisations of these methods to 
other scenarios is problematic, thus highlighting the uniqueness of LHV expressed 
in terms of linear Boolean functions. 
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The work in this thesis is by no means a complete analysis of the role of 
computation in Bell tests, but perhaps strengthens the study of the relationship 
between the two. There is much work to be done still in understanding quantum 
correlations and whilst we have discovered new phenomena, the characterisation 
of quantum correlations remains broadly ill- understood. We have conjectured 
that all quantum correlators for the bipartite scenario can be captured by a 
particular set of quantum operators in the Navacues-Pironio-Acin hierarchy. It 
would be of great interest if this were true and if a similar behaviour occurred 
in the multipartite setting. This is an immediate problem raised by work in this 
thesis and worth pursuing as a continuation. 

In recent years, a significant amount of effort into classifying the geometric 
nature of non-signalling correlations (see e.g. |Pironio2011j ). We have shown 
that some of the extremal structure of the polytope of non-signalling theories 
can be revealed by the extremal structure of correlators. It would be interesting 
to see if there is a connection between correlators and the rest of the vertices of 
the non-signalling polytope. This picture is not clear as some of the non-LHV 
vertices of the non-signalling polytope for (3, 2, 2) do not violate any of the facet 
Bell inequalities for correlators in this setting. However, the generality of the 
correlator description in terms of computations could give a handle on some of 
these ideas. 

Continuing with the theme of characterising the full probability distribution 
instead of correlators, it would be interesting to study the effect of data post- 
selection on non-signalling resources. The difficulty in relating the inputs of 
parties to each other as we have done can allow LHV resources to achieve cor- 
relations that violate locality. Since in the correlator framework all single-site 
maps get mapped to a single output, this violation of locality has little or no 
effect. It would be interesting to allow resources that exploit "non- locality" in 
this way but still cannot produce something that quantum mechanics can pro- 
duce. This is akin to the detector loophole where the LHV region is enlarged by 
post-selection, but below a threshold detection efficiency, still is not large enough 
to simulate quantum correlators. 

Can our approach to correlators in terms of functions to applied to other issues 
in the study of Bell tests? An interesting potential avenue for further research 
could be the "monogamy of Bell correlations" jP awlowski20031 IToner2006j . This 
is similar to the "monogamy of entanglement']^ where we have three parties and 

^This expression is thought to originate with Charhe Bennett [Toner2006j . 
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if two parties are maximally entangled then the third party cannot be entangled 
with either of these two parties. It has been shown that Bell correlations behave 
in an analogous fashion where if two parties out of three violate a bipartite 
Bell inequality, then the correlations between either of these two parties and a 
third party cannot achieve a violation of the same inequality. This has been 
generalised to many parties with these parties divided into two overlapping sets 
[Pawlowski2003] . Can the language of functions, or computations explain that 
if one set of parties is trying to perform a computation, then by a satisfiability 
argument, the other set cannot produce this same function? 

Finally, since we have established a connection between MBQC and Bell tests, 
it would be interesting to simulate quantum computations such as, say, Shor's 
algorithm |Shorl99"7] and see if it violates a Bell inequality. In some sense then it 
could be seen that this computation cannot be resolved with a classical picture of 
the world, or it would highlight the non-classical aspects of this algorithm. Post- 
selection and quantum computation have been studied before by Scott Aaronson 
[Aaronson2005j . in a different format to our own framework. It was shown by 
Aaronson that quantum computation with post-selection of a different kind to 
ours is incredibly powerful. Speculatively, there may be some connection between 
our work and ideas in computational complexity. We have already made the 
connection to the class IQP |Shepherd2009| , this class may be amenable to the 
study of our Bell tests with post-selection. 

We hope to address the issues raised by this thesis in further research. We also 
hope that the work presented has produced the motivation to consider "device- 
independent" computing. This would be the ability to confirm that we have 
built something that uses quantum mechanics to compute but without knowing 
anything about the device. We have shown that the Bell inequality is a useful 
metric for quantum behaviour, in particular with regards to computation. More 
importantly, we hope that the work in this thesis can lead to new approaches of 
thinking about Bell tests, perhaps motivated by computation. 

N. David Mermin once quoted a "distinguished Princeton physicist" as saying 
[Merminl985j ."Anvbodv who's not bothered by Bell's theorem has to have rocks 
in his head." The Bell inequality has been a profound addition to science and we 
hope that the work in this thesis contributes to new aspects of its study. 
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